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Methl and Meth2 Polynucleotides and Polypeptides 



Cross-Reference to Related Applications 

This application is also a continuation-in-part of U.S. Appl. No. 09/318,208, filed 
May 25, 1999, the disclosure of which is incorporated by reference herein; said 09/3 18,208 
is a continuation-in-part of U.S. Appl. No. 09/235,8 10, filed January 22, 1999, the disclosure 
of which is incorporated by reference herein; said 09/235,8 10 claims priority benefit of U.S. 
Appl. No. 60/098,539, filed August 28, 1998, and U.S. Appl. No. 60/072,298, filed January 
23, 1998, tiie disclosures of both of which are incorporated by reference herein. This 
application is also a continuation in part of U.S. Appl. No. 08/845,496, filed April 24, 1997, 
the disclosure of which is incorporated by reference herein. This application also claims 

priority benefit of U.S. Appl. No. (Attorney Docket No. 1488.1070005), filed 

August 11, 1999 and U.S. Appl. No. 60/144,882, filed July 20, 1999, the disclosures of both 
of which are incorporated by reference herein. 

Background of the Invention 

Federally-Sponsored Research and Development 

Part of the work performed during development of this invention utilized U.S. 
Government funds. The U.S. Government has certain rights in this invention. 

Field of the Invention 

The present invention relates to novel anti-angiogenic proteins, related to 
thrombospondin. More specifically, isolated nucleic acid molecules are provided encoding 
human METHl and METH2 (ME, for metalloprotease, and TH, for thrombospondin). 
METHl and METH2 polypeptides are also provided, as are vectors, host cells and 
recombinant methods for producing the same. Also provided are diagnostic methods for the 
prognosis of cancer and therapeutic methods for treating individuals in need of an increased 
amount of METHl or METH2. Also provided are metiiods for inhibiting angiogenesis 
using METHl or METH2. 



Related Art 



Angiogenesis, the formation of new blood vessels from pre-existing vasculature, is 
a tightly regulated process in normal adults. Under physiological circumstances, growth of 
new capillaries is tightly controlled by an interplay of growth regulatory proteins which act 
either to stimulate or to inhibit blood vessel growth. Normally, the balance between these 
forces is tipped in favor of inhibition and consequently blood vessel growth is restrained. 
Under certain pathological circumstances, however, local inhibitory controls are unable to 
restrain the increased activity of angiogenic inducers. Angiogenesis is a key step in the 
metastasis of cancer (Folkman, Nature Med. 7:27-31 (1995)) and in abnormal wound 
healing, inflammation, rheumatoid arthritis, psoriasis, and diabetic retinopathy, it is integral 
to the pathology (Folkman et al. Science 255:442-447 (1987)), engendering the hope that 
these pathological entities could be regulated by pharmacological and/or genetic suppression 
of blood vessel growth (Iruela-Arispe et al, Thromb. Haem. 78:612-611 1997)). 

Thrombospondin-1 (TSP-1) is a 450 kDa, anti-angiogenic adhesive glycoprotein 
released from activated platelets and secreted by growing cells (reviewed in Adams, Int. J. 
Biochem. Cell. Biol 29:861-865 (1997)). TSP-1 is a homotrimer, with each subunit 
comprised of a 1 152 amino acid residue polypeptide, post-translationally modified by A^- 
linked glycosylation and beta-hydroxylation of asparagine residues. 

TSP- 1 protein and mRNA levels are regulated by a variety of factors. TSP- 1 protein 
levels are downregulated by lL-1 alpha and TNF alpha. TSP-1 mRNA and protein levels 
are upregulated by polypeptide growth factors including PDGF, TGF-beta, and bFGF 
(Bomstein, FasebJ. 6: 3290-3299 (1992)) and are also regulated by the level of expression 
of thep53 tumor suppressor gene product (Dameron et al, Science 2<55: 1582-1584 (1994)). 
At least four other members of the thrombospondin family have been identified: TSP-2, 
TSP-3, TSP-4, and TSP-5 (also called COMP). There is a need in the art to identify other 
molecules involved in the regulation of angiogenesis. 
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Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the METHl polypeptide having the amino acid sequence shown 
in SEQ ID NO:2 or the amino acid sequence encoded by the cDNA clone deposited in a 
5 bacterial host as ATCC Deposit Number 209581 on January 15, 1998. 

The present invention also provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the METH2 polypeptide having the amino acid sequence shown 
in SEQ ID NO:4 or the amino acid sequence encoded by the cDNA clone deposited in a 
bacterial host as ATCC Deposit Number 209582 on January 15, 1998. 
I31 0 The present invention also relates to recombinant vectors , which include the isolated 

'r^ nucleic acid molecules of the present invention, and to host cells containing the recombinant 

vectors, as well as to methods of making such vectors and host cells and for using them for 
pi production of METHl or METH2 polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated METHl or METH2 polypeptide having 
15 an amino acid sequence encoded by a polynucleotide described herein. 

m The invention further provides a diagnostic method useful during diagnosis or 

I , I prognosis of cancer. 

y An additional aspect of the invention is related to a method for treating an individual 

in need of an increased level of METHl or METH2 activity in the body comprising 

20 administering to such an individual a composition comprising a therapeutically effective 

amount of an isolated METHl or METH2 polypeptide of the invention or an agonist thereof. 

Brief Description of the Figures 

Figure 1 shows the nucleotide (SEQ ID NO:l) and deduced amino acid (SEQ ID 
NO:2) sequences of METHl. The protein has a predicted leader sequence of about 28 
25 amino acid residues (underlined). 

Figure 2 shows the nucleotide (SEQ ID N0:3) and deduced amino acid (SEQ ID 
NO:4) sequences of METH2. The protein has a predicted leader sequence of about 23 
amino acid residues (underlined). 
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Figure 3 shows a comparison of the amino acid sequence of METHl (SEQ ID NO:2) 
and METH2 (SEQ ID NO:4) with that of their closest homologue, a bovine metallopro tease 
(pNPI) (SEQ ID NO:5). Identical amino acids are boxed. Functional domains predicted by 
sequence and structural homology are labeled, including the signal peptide (single line), the 
5 potential cleavage site for mammalian subtilisin (double underlined), the zinc-binding-site 

(dotted line; amino acids 383-395 in METHl and 363-375 in METH2) in the 
metalloprotease domain, and the putative disintegrin loops (arrows). 

Figure 4 shows the primary structure of METHl, METH2 and pNPI which includes 
a prodomain, a catalytic metalloprotease domain, a cysteine rich disintegrin domain, a TSP- 
1 0 like domain, a spacer region and a different number of TSP-like domains, three for METH 1 , 

P two for METH2, and four for pNPL 

fi Figure 5 shows a comparison of the TSP-like domain of METHl (SEQ ID NO:2) 

' J and METH2 (SEQ ID NO:4) with those of TSPl (SEQ ID NOs:6, 7, and 8) and TSP2 (SEQ 

IP ID NOs:9, 10, and 1 1), cysteines are numbered 1 to 6, tryptophans are marked by asterisks. 

3 5 Figure 6 shows that peptides and recombinant protein derived from the TSP-like 

domain of METHl and METH2 block VEGF-induced angiogenesis. Angiogenesis was 
IP induced on C AMs from 12-1 4-day-old embryos using a nylon mesh containing VEGF casted 

J , 1 on matrigel and in the presence or absence of the peptides or recombinant protein. Capillary 

density was evaluated as described in Example 4. Positive and negative control included 
20 VEGF alone and vehicle alone, respectively. (A) Quantification of the angiogenic response 

induced by VEGF in the presence of recombinant proteins. TSPl, purified platelet TSPl, 
GST, purified GST, GST-TSP 1 , GST-METH 1 , and GST-METH2 are described in Example 
4. (B) Quantification of the angiogenic response induced by VEGF in the presence or 
absence of the peptides; P-TSP 1 , P-METH 1 , and P-METH2 (peptide derived from the Type 
25 I repeats of TSP, METHl and METH2, respectively); SCI and SC2 are scramble peptides 

used as controls. (C) Dose-response of the VEGF-induced angiogenesis in the presence of 
GST-METH 1. (D) Dose-response of the VEGF-induced angiogenesis in the presence of 
GST-METH2. The angiogenic index was expressed considering the vascular response from 
the VEGF-matrigel as 100% and subtracting the background levels (matrigel alone). Assays 
30 were repeated, at least, twice. Each treatment was done in triplicate. Values represent the 

mean, bars indicate standard deviations. '''p<O.OOL 
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Figure 7 shows the effect of METHl and METH2 recombinant proteins on bFGF- 
stimulated cell proliferation. Cells were cultured on 24-well plates in media containing 
bFGF and the recombinant protein to be tested (3|ig/nil, unless indicated in the graph). 
Controls included vehicle or GST recombinant protein alone. (A), HDEC, human dermal 
5 endothelial cells; (B), HMEC, human mammary epithelial cells; (C), HDF, human dermal 

fibroblasts; (D), SMC, smooth muscle cells; (E) Dose-response of GST-METHl and GST- 
METH2 on HDEC proliferation. Experiments were repeated, at least, twice. Each treatment 
was done in triplicate. Values represent the mean, bars indicate standard deviations. 
*p<0.01. 

10 Figure 8 shows a schematic representation of the pHE4-5 expression vector (SEQ 

Pi ID NO: 12) and the subcloned METHl or METH2 cDNA coding sequence. The locations 

of the kanamycin resistance marker gene, the METH 1 or METH2 coding sequence, the oriC 
^4 sequence, and the laclq coding sequence are indicated. 

£n Figure 9 shows the nucleotide sequence of the regulatory elements of the pHE 

promoter (SEQ ID NO: 13). The two lac operator sequences, the Shine-Delgarno sequence 

^; (S/D), and the terminal HindUI and Ndel restriction sites (italicized) are indicated. 

m Figure 10 shows an analysis of the METHl amino acid sequence. Alpha, beta, turn 

r and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible regions; 

^0 antigenic index and surface probability are shown, and all were generated using the default 

20 settings. In the "Antigenic Index or Jameson-Wolf" graph, the positive peaks indicate 

locations of the highly antigenic regions of the METHl or METH2 protein, i.e., regions 
from which epitope-bearing peptides of the invention can be obtained. The domains defined 
by these graphs are contemplated by the present invention. Tabular representation of the 
data summarized graphically in Figure 10 can be found in Table 1 . 
25 Figure 1 1 shows an analysis of the METH2 amino acid sequence. Alpha, beta, turn 

and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible regions; 
antigenic index and surface probability are shown, and all were generated using the default 
settings. In the "Antigenic Index or Jameson- Wolf" graph, the positive peaks indicate 
locations of the highly antigenic regions of the METHl or METH2 protein, i.e., regions 
30 from which epitope-bearing peptides of the invention can be obtained. The domains defined 

by these graphs are contemplated by the present invention. Tabular representation of the 
data summarized graphically in Figure 1 1 can be found in Table 2. 
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Detailed Description of the Preferred Embodiments 

By screening cDNA libraries with cDNA encoding the anti-angiogenic domain of 
TSP- 1 , the present inventors have identified two novel proteins, METH 1 and METH2 (also 
called VEGA-1 and VEGA-2, respectively, for vascular endothelial growth antagonist) 
which contain the anti-angiogenic domain of TSP-1, a metalloproteinase domain, and a 
disintegrin-like domain. The present inventors have demonstrated that both METHl and 
METH2 have anti-angiogenic activity. 

Thus, the present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding a METHl polypeptide having the amino acid sequence shown in 
SEQ ID NO:2, which was determined by sequencing a cloned cDNA. The METHl protein 
of the present invention shares sequence homology with thrombospondin- 1 and pNPL The 
nucleotide sequence shown in SEQ ID NO: 1 was obtained by sequencing a cDNA clone, 
which was deposited on January 15, 1998 at the American Type Culture Collection, 10801 
University Boulevard, Manassas, Virginia 201 10-2209, and given accession number 20958 1 . 
The cDNA clone contained in ATCC Deposit No. 209581 contains a METHl sequence, 
encoding amino acids 1 to 950 of SEQ ID NO:2. 

The present invention also provides isolated nucleic acid molecules comprising a 
polynucleotide encoding a METH2 polypeptide having the amino acid sequence shown in 
SEQ ID NO:4, which was partially determined by sequencing a cloned cDNA. The METH2 
protein of the present invention shares sequence homology with thrombospondin- 1 and 
pNPL The nucleotide sequence shown in SEQ ID NO:3 was partially obtained by 
sequencing a cDNA clone, which was deposited on January 15, 1998 at the American Type 
Culture Collection, 10801 University Boulevard, Manassas, Virginia 201 10-2209, and given 
accession number 209582. The cDNA clone contained in ATCC Deposit No. 209582 
contains a pcirtial METH2 sequence, encoding amino acids 1 12-890 of SEQ ID NO:4. 

Nucleic Acid Molecules 

Some of the nucleotide sequences determined by sequencing a DN A molecule herein 
were determined using an automated DNA sequencer (such as the Model 373 from Applied 
Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA molecules 
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determined herein were predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this automated 
approach, any nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods well known 
in the art. As is also known in the art, a single insertion or deletion in a determined 
nucleotide sequence compared to the actual sequence will cause a frame shift in translation 
of the nucleotide sequence such that the predicted amino acid sequence encoded by a 
determined nucleotide sequence will be completely different from the amino acid sequence 
actually encoded by the sequenced DNA molecule, beginning at the point of such an 
insertion or deletion. 

Using the information provided herein, such as the nucleotide sequence in SEQ ID 
NO: 1 or SEQ ID NO:3, a nucleic acid molecule of the present invention encoding a 
METHl or METH2 polypeptide may be obtained using standard cloning and screening 
procedures, such as those for cloning cDNAs using mRNA as starting material. Illustrative 
of the invention, the nucleic acid molecule described in SEQ ID NO: 1 was discovered in a 
cDNA library derived from human heart and the nucleic acid molecule described in SEQ ED 
NO:3 was discovered in a cDNA library derived from human lung. The determined 
nucleotide sequence of the METHl cDNA of SEQ ID NO: 1 contains an open reading frame 
encoding a protein of about 950 amino acid residues, including a predicted leader sequence 
of about 28 amino acid residues. The present inventors have determined that the nucleotide 
sequence of the METH2 cDNA of SEQ ID NO:3 contains an open reading frame encoding 
a protein of about 890 amino acid residues, including a predicted leader sequence of about 
23 amino acid residues. 

The present invention also provides the mature form(s) of the METHl and METH2 
proteins of the present invention. According to the signal hypothesis, proteins secreted by 
mammalian cells have a signal or secretory leader sequence which is cleaved from the 
mature protein once export of the growing protein chain across the rough endoplasmic 
reticulum has been initiated. Most mammaUan cells and even insect cells cleave secreted 
proteins with the same specificity. However, in some cases, cleavage of a secreted protein 
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is not entirely uniform, which results in two or naore mature species on the protein. Further, 
it has long been known that the cleavage specificity of a secreted protein is ultimately 
determined by the primary structure of the complete protein, that is, it is inherent in the 
amino acid sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature METHl polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in the host identified as ATCC Deposit No. 
209581 and as shown in SEQ ID NO:2. The present invention also provides a nucleotide 
sequence encoding the mature METH2 polypeptide having the amino acid sequence as 
shown in SEQ ID NO:4. By the mature METHl protein having the amino acid sequence 
encoded by the cDNA clone contained in the host identified as ATCC Deposit No. 209581 
is meant the mature form(s) of the METHl protein produced by expression in a mammalian 
cell (e.g., COS ceUs, as described below) of the complete open reading frame encoded by 
the human DNA sequence of the clone contained in the vector in the deposited host. As 
indicated below, the mature METHl having the amino acid sequence encoded by the cDNA 
clone contained in ATCC Deposit No. 209581 may or may not differ from the predicted 
"mature" METHl protein shown in SEQ ID NO:2 (amino acids from about 29 to about 950) 
depending on the accuracy of the predicted cleavage site based on computer analysis; and 
the mature METH2 may or may not differ from the predicted "mature" METH2 protein 
shown in SEQ ID NO: 4 (amino acids from about 24 to about 890) depending on the 
accuracy of the predicted cleavage site based on computer analysis. Additionally, the mature 
form of the protein may flien undergo even more processing after the prodomain has been 
cleaved (e.g., a second cleavage distal to the prodomain, located in the metalloprotease 
domain/cysteine-rich region). Thus, "mature" forms of the proteins encompass not only 
those forms produced by cleavage of the prodomain, but also other processed forms of the 
protein. 

Methods for predicting whether a protein has a secretory leader as well as the 
cleavage point for that leader sequence are available. For instance, the methods of McGeoch 
( Virus Res. 3:27 1 -286 ( 1 985)) and von Heinje (Nucleic Acids Res. 74:4683-4690 ( 1 986)) can 
be used. The accuracy of predicting the cleavage points of known mammalian secretory 
proteins for each of these methods is in the range of 75-80%. von Heinje, supra. However, 
the two methods do not always produce the same predicted cleavage point(s) for a given 
protein. 
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In the present case, the predicted amino acid sequence of the complete METHl and 
METH2 polypeptides of the present invention were analyzed by a computer program 
("PSORT") (K. Nakai and M. Kanehisa, Genomics 74:897-91 1 (1992)), which is an expert 
system for predicting the cellular location of a protein based on the amino acid sequence. 
As part of this computational prediction of locahzation, the methods of McGeoch and von 
Heinje are incorporated. The analysis by the PSORT program predicted the cleavage site 
between amino acids 28 and 29 in SEQ ID NO:2 and amino acids 23 and 24 in SEQ ID 
NO:4. Thereafter, the complete amino acid sequences were further analyzed by visual 
inspection, applying a simple form of the (-1,-3) rule of von Heinje. von Heinje, supra. 
Thus, the leader sequence for the METHl protein is predicted to consist of amino acid 
residues from about 1 to about 28 in SEQ ID NO:2, while the mature METHl protein is 
predicted to consist of residues from about 29 to about 950; and the leader sequence for the 
METH2 protein is predicted to consist of amino acid residues from about 1 to about 23 in 
SEQ ID NO:4, while the mature METH2 protein is predicted to consist of residues from 
about 24 to about 890. An alternative predicted mature METHl protein consists of residues 
30 to 950 in SEQ ID NO:2. Another alternative predicted mature METHl protein consists 
of residues 35 to 950 of SEQ ID NO:2. An alternative predicted mature METH2 protein 
consists of residues 31 to 890 of SEQ ID NO:4, 

As one of ordinary skill would appreciate, due to the possibilities of sequencing 
errors, as well as the variability of cleavage sites for leaders in different known proteins, the 
predicted METHl polypeptide encoded by the deposited cDNA comprises about 950 amino 
acids, but may be anywhere in the range of 910-990 amino acids; and the predicted leader 
sequence of this protein is about 28 amino acids, but may be anywhere in the range of about 
18 to about 38 amino acids. An alternative predicted METHl polypeptide is shown in SEQ 
ID NO: 125, and comprises an additional 18 amino acid residues on the N-terminus. Also, 
the predicted METH2 polypeptide comprises about 890 amino acids, but may be anywhere 
in the range of 850 to about 930 amino acids; and the predicted leader sequence of this 
protein is about 23 amino acids, but may be anywhere in the range of about 13 to about 33 
amino acids. 

As indicated, nucleic acid molecules of the present invention may be in the form of 
RNA, such as mRNA, or in the form of DNA, including, for instance, cDNA and genomic 
DNA obtained by cloning or produced synthetically. The DNA may be double-stranded or 
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single-stranded. Single-stranded DNA or RNA may be the coding strand, also known as the 
sense strand, or it may be the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or 
RNA, which has been removed from its native environment. For example, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes of the present 
invention. Further examples of isolated DNA molecules include recombinant DNA 
molecules maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts 
of the DNA molecules of the present invention. Isolated nucleic acid molecules according 
to the present invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA molecules 
comprising an open reading frame (ORF) shown in SEQ ID NO:l; DNA molecules 
comprising the coding sequence for the mature METH 1 protein; and DNA molecules which 
comprise a sequence substantially different from those described above but which, due to 
the degeneracy of the genetic code, still encode the METHl protein. Also included are DNA 
molecules comprising an open reading frame (ORF) shown in SEQ ID NO:3; DNA 
molecules comprising the coding sequence for the mature METH2 protein; and DNA 
molecules which comprise a sequence substantially different from those described above but 
which, due to the degeneracy of the genetic code, still encode the METH2 protein. Of 
course, the genetic code is well known in the art. Thus, it would be routine for one skilled 
in the art to generate such degenerate variants. 

Polynucleotides of the present invention encompass not only polynucleotides 
encoding the full length sequence, but polynucleotides encoding the mature, proprotein, 
processed forms of the protein, deletion mutants, substitution variants, allelic variants, 
analogs, derivatives, etc. 

In another aspect, the invention provides isolated nucleic acid molecules encoding 
the METHl or METH2 polypeptides having an amino acid sequence as encoded by the 
cDNA clones contained in the plasmids deposited as ATCC Deposit No. 20958 1 on January 
15, 1998 or ATCC Deposit No. 209582 on January 15, 1998, respectively. In a further 
embodiment, nucleic acid molecules are provided encoding the mature METH 1 or METH2 
polypeptide or the full-length METHl or METH2 polypeptide lacking the N-terminal 
methionine. The invention also provides an isolated nucleic acid molecule having the 
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nucleotide sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 or the nucleotide sequence 
of the METHl or METH2 cDNA contained in the above-described deposited clones, or a 
nucleic acid molecule having a sequence complementary to one of the above sequences. 
Such isolated molecules, particularly DNA molecules, are useful as probes for gene 
mapping, by in situ hybridization with chromosomes, and for detecting expression of the 
METHl or METH2 gene in human tissue, for instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated nucleic acid molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 1 or SEQ ID NO:3 is intended fragments at least about 15 nt, and more preferably at 
least about 20 nt, still more preferably at least about 30 nt, and even more preferably, at least 
about 40 nt in length which are useful as diagnostic probes and primers as discussed herein. 
Of course, larger fragments 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 
700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 
1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, or 3000 nt in length are 
also useful according to the present invention as are fragments corresponding to most, if not 
all, of the nucleotide sequence of the deposited cDNA or as shown in SEQ ID NO: 1 or SEQ 
ID NO: 3. By a fragment at least 20 nt in length, for example, is intended fragments which 
include 20 or more contiguous bases from the nucleotide sequence of the deposited cDNA 
or the nucleotide sequence as shown in SEQ ID NO:l or SEQ ID NO:3. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding epitope-bearing portions of the METHl or METH2 protein. Methods 
for determining epitope-bearing portions of the METHl and METH2 proteins are described 
in detail below. 

Other preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding: the metalloprotease domain of METH 1 , amino acids 235 to 459 in SEQ 
ID NO:2; the disintegrin domain of METHl, amino acids 460 to 544 in SEQ ID n6:2; the 
first TSP-Uke domain of METHl , amino acids 545 to 598 in SEQ ID NO:2; the second TSP- 
like domain of METHl, amino acids 841 to 894 in SEQ ID NO:2; the third TSP-like domain 
of METHl, amino acids 895 to 934 in SEQ ID NO:2; amino acids 536 to 613 in SEQ ID 
NO:2; amino acids 549 to 563 in SEQ ID NO:2; the metalloprotease domain of METH2, 
amino acids 214 to 439 in SEQ ID NO:4; the disintegrin domain of METH2, amino acids 
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440 to 529 in SEQ ID NO:4; the first TSP-like domain of METH2, amino acids 530 to 583 
in SEQ ID NO:4; the second TSP-like domain of METH2, amino acids 837 to 890 in SEQ 
ID NO:4; amino acids 280 to 606 in SEQ ID NO:4; and amino acids 529 to 548 in SEQ ED 
NO:4; and nucleic acid molecules encoding combinations of these domains. 

Thus, preferred embodiments include a nucleic acid molecule encoding a METHl 
or METH2 protein lacking the signal sequence (cleavage occurs for METHl somewhere 
about 1-24 to about 1-34 and about 1-23 to about 1-30 for METH2); a METHl or METH2 
protein lacking the signal sequence and the prodomain (cleavage for the prodomain can 
occur in METHl between amino acids about 232 to 236 and in METH2 between amino 
acids about 211 to 215); a METHl or METH2 protein lacking the signal sequence, the 
prodomain, and the metalloprotease domain; a METH 1 or METH2 protein lacking the signal 
sequence, the prodomain, the metalloprotease domain, and the cysteine rich domain; a 
METHl or METH2 protein lacking the signal sequence, the prodomain, the metalloprotease 
domain, cysteine rich domain and TSPl; a METHl or METH2 protein lacking the signal 
sequence, the prodomain, the metalloprotease domain, cysteine rich domain, TSPl and 
TSP2. Also preferred are polypeptides encoded by such nucleic acids. 

Similarly, preferred embodiments include a nucleic acid encoding a METH 1 protein 
lacking TSP3; aMETHl protein lacking TSP2 and TSP3; aMETHl protein lacking TSP3, 
TSP2, and TSPl; a METHl protein lacking the cysteine-rich domain, TSPl, TSP2, and 
TSP3; a METHl protein lacking the metalloprotease domain, the cysteine-rich domain, 
TSPl, TSP2 and TSP3; and a METHl protein lacking the prodomain, the metalloprotease 
domain, the cysteine-rich domain, TSPl , TSP2, and TSP3; a METH2 protein lacking TSP2; 
a METH2 protein lacking TSPl and TSP2; a METH2 protein lacking the cysteine-rich 
domain, TSPl and TSP2; a METH2 protein lacking the metalloprotease domain, the 
cysteine-rich domain, TSPl and TSP2; and a METH2 protein lacking the prodomain, the 
metalloprotease domain, the cysteine-rich domain, TSPl and TSP2. Also preferred are 
polypeptide encoded by such nucleic acids. 

Also preferred are nucleic acids encoding any combination of METH 1 domains. For 
example, nucleic acid molecule encoding polypeptides comprising the following domains 
of METHl are preferred: signal sequence and prodomain; signal sequence, prodomain and 
metalloprotease domain; signal sequence and metalloprotease domain; signal sequence, 
prodomain, metalloprotease domain, and cysteine rich domain; signal sequence and cysteine 
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rich domain; signal sequence, metalloprotease domain and cysteine rich domain; signal 
sequence, prodomain, and cysteine rich domain; signal sequence, prodomain, 
metalloprotease domain, cysteine rich domain, and TSPl ; signal sequence and TSPl ; signal 
sequence, prodomain and TSPl; signal sequence, prodomain, metalloprotease domain and 
5 TSPl; signal sequence, metalloprotease domain, and TSPl; signal sequence, prodomain, 

cysteine rich domain and TSPl; signal sequence, cysteine rich domain and TSPl; signal 
sequence, metalloprotease domain, cysteine rich domain and TSPl; signal sequence, 
prodomain, metalloprotease domain, cysteine rich domain, TSP 1 and TSP2 ; signal sequence 
and TSP2; signal sequence, prodomain and TSP2; signal sequence, metalloprotease domain 
10 and TSP2; signal sequence, cysteine rich domain and TSP2; signal sequence, TSPl and 

O TSP2; signal sequence, prodomain, metalloprotease domain and TSP2; signal sequence, 

jr prodomain, cysteine rich domain and TSP2; signal sequence, prodomain, TSPl and TSP2; 

^ ;j signal sequence, metalloprotease domain, cysteine rich domain and TSP2; signal sequence, 

metalloprotease domain, TSPl and TSP2; signal sequence, metalloprotease domain, cysteine 
fAS rich domain, TSPl and TSP2; signal sequence, prodomain, cysteine rich domain, TSPl and 

TSP2; signal sequence and TSP3; signal sequence, prodomain and TSP3; signal sequence, 
^ prodomain, metalloprotease domain and TSP3 ; signal sequence, metalloprotease domain and 

t y TSP3 ; signal sequence, prodomain, metalloprotease domain, cysteine rich domain and TSP3 ; 

signal sequence, cysteine rich domain and TSP3 ; signal sequence, prodomain, cysteine rich 
20 domain and TSP3; signal sequence, prodomain, metalloprotease domain, cysteine rich 

domain, TSPl and TSP3; signal sequence, TSPl and TSP3; signal sequence, prodomain, 
TSPl and TSP3; signal sequence, prodomain, metalloprotease domain, TSPl and TSP3; 
signal sequence, prodomain, cysteine rich domain, TSPl and TSP3; signal sequence, TSP2 
and TSP3; signal sequence, prodomain, cysteine rich domain, TSPl , TSP2 and TSP3; signal 
25 sequence, prodomain, metalloprotease domain, TSPl, TSP2 and TSP3; signal sequence, 

metalloprotease domain, TSPl, TSP2 and TSP3; signal sequence, TSPl, TSP2 and TSP3; 
signal sequence, metalloprotease domain, cysteine rich domain, TSPl, TSP2 and TSP3; 
signal sequence, prodomain, metalloprotease domain, TSPl and TSP2; signal sequence, 
prodomain, metalloprotease domain, cysteine rich domain, and TSP2; signal sequence, 
30 prodomain, metalloprotease domain, cysteine rich domain, TSP2 and TSP3 ; signal sequence, 

TSPl, TSP2 and TSP3; signal sequence, cysteine rich domain, TSPl and TSP2; signal 
sequence, cysteine rich domain, TSPl and TSP3; signal sequence, cysteine rich domain. 
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TSP2 and TSP3; signal sequence, cysteine rich domain, TSPl, TSP2, and TSP3; signal 
sequence, metalloprotease domain, cysteine rich domain, and TSP3; signal sequence, 
metalloprotease domain, cysteine rich domain, TSPl and TSP3; signal sequence, 
metalloprotease domain, cysteine rich domain, TSP2 and TSP3; signal sequence, 
metalloprotease domain, TSPl and TSP3; signal sequence, metalloprotease domain, TSP2 
and TSP3; signal sequence, prodomain, metalloprotease domain, TSPl and TSP3; signal 
sequence, prodomain, metalloprotease domain, TSP2 and TSP3; prodomain and 
metalloprotease domain; prodomain and cysteine rich domain; prodomain and TSPl; 
prodomain and TSP2; prodomain and TSP3; prodomain, metalloprotease domain and 
cysteine rich domain; prodomain, metalloprotease domain and TSPl; prodomain, 
metalloprotease domain and TSP2; prodomain, metalloprotease domain and TSP3; 
prodomain, metalloprotease domain, cysteine rich domain and TSPl; prodomain, 
metalloprotease domain, cysteine rich domain and TSP2; prodomain, metalloprotease 
domain, cysteine rich domain and TSP3; prodomain, cysteine rich domain and TSPl; 
prodomain, cysteine rich domain and TSP2; prodomain, cysteine rich domain and TSP3; 
prodomain, metalloprotease domain, cysteine rich domain, TSPl and TSP2; prodomain, 
metalloprotease domain, cysteine rich domain, TSPl , TSP2 and TSP3 ; prodomain, cysteine 
rich domain, TSPl and TSP2; prodomain, metalloprotease domain, TSPl and TSP2; 
prodomain, metalloprotease domain, cysteine rich domain, TSP2 and TSP3; prodomain, 
cysteine rich domain, TSPl and TSP3; prodomain, cysteine rich domain, TSP2 and TSP3; 
prodomain, TSPl and TSP2; prodomain, TSPl and TSP3; prodomain, TSP2 and TSP3; 
prodomain, metalloprotease domain, TSP 1 and TSP2; prodomain, metalloprotease domain, 
TSPl and TSP3; prodomain, metalloprotease domain, TSP2 and TSP3; prodomain, 
metalloprotease domain, cysteine rich domain, TSP2 and TSP3; prodomain, TSPl and 
TSP2; prodomain, TSPl and TSP3; prodomain, TSP2 and TSP3; prodomain, 
metalloprotease domain, TSPl and TSP2; prodomain, metalloprotease domain, TSPl and 
TSP3; prodomain, metalloprotease domain, TSP2 and TSP3; prodomain, metalloprotease 
domain, cysteine domain, TSPl and TSP3; prodomain, cysteine rich domain, TSPl, TSP2 
and TSP3; prodomain, metalloprotease domain, TSPl, TSP2, and TSP3; metalloprotease 
domain and cysteine rich domain; metalloprotease domain and TSPl; metalloprotease 
domain and TSP2; metalloprotease domain and TSP3; metalloprotease domain, cysteine rich 
domain and TSP 1 ; metalloprotease domain, cysteine rich domain and TSP2; metalloprotease 
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domain, cysteine rich domain and TSP3; metalloprotease domain, cysteine rich domain, 
TSPl and TSP2; metalloprotease domain, cysteine rich domain, TSPl, TSP2 and TSP3; 
metalloprotease domain, cysteine rich domain, TSPl and TSP3; metalloprotease domain, 
cysteine rich domain, TSP2 and TSP3; metalloprotease domain, TSPl and TSP2; 
5 metalloprotease domain, TSPl and TSP3; metalloprotease domain, TSP2 and TSP3; 

metalloprotease domain, TSPl, TSP2 and TSP3; cysteine rich domain and TSPl; cysteine 
rich domain and TSP2; cysteine rich domain and TSP3; cysteine rich domain, TSPl and 
TSP2; cysteine rich domain, TSPl and TSP3; cysteine rich domain, TSP2 and TSP3; 
cysteine rich domain, TSPl, TSP2 and TSP3; TSPl and TSP2; TSPl and TSP3; TSP2 and 
10 TSP3; and/or TSPl, TSP2 and TSP3. These domains may be present in the METHl 

~ molecule in the same order or a different order than in the naturally occurring molecule. 

: Also preferred are polypeptides encoded by such nucleic acids. 

I Also preferred are nucleic acids encoding any combination of METH2 domains. For 

\ example, nucleic acid molecule encoding polypeptides comprising the following domains 

15 of METH2 are preferred: signal sequence and prodomain; signal sequence, prodomain and 

J metalloprotease domain; signal sequence and metalloprotease domain; signal sequence, 

prodomain, metalloprotease domain, and cysteine rich domain; signal sequence and cysteine 
J rich domain; signal sequence, metalloprotease domain and cysteine rich domain; signal 

\ sequence, prodomain, and cysteine rich domain; signal sequence, prodomain, 

20 metalloprotease domain, cysteine rich domain, and TSP 1 ; signal sequence and TSP 1 ; signal 

sequence, prodomain and TSPl; signal sequence, prodomain, metalloprotease domain and 
TSPl; signal sequence, metalloprotease domain, and TSPl; signal sequence, prodomain, 
cysteine rich domain and TSPl; signal sequence, cysteine rich domain and TSPl; signal 
sequence, metalloprotease domain, cysteine rich domain and TSPl; signal sequence, 
15 prodomain, metalloprotease domain, cysteine rich domain, TSP 1 and TSP2; signal sequence 

and TSP2; signal sequence, prodomain and TSP2; signal sequence, metalloprotease domain 
and TSP2; signal sequence, cysteine rich domain and TSP2; signal sequence, TSPl and 
TSP2; signal sequence, prodomain, metalloprotease domain and TSP2; signal sequence, 
prodomain, cysteine rich domain and TSP2; signal sequence, prodomain, TSPl and TSP2; 
50 signal sequence, metalloprotease domain, cysteine rich domain and TSP2; signal sequence, 

metalloprotease domain, TSPl and TSP2; signal sequence, metalloprotease domain, cysteine 
rich domain, TSPl and TSP2; signal sequence, prodomain, cysteine rich domain, TSPl and 
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TSP2; signal sequence, prodomain, metalloprotease domain, TSPl and TSP2; signal 
sequence, prodomain, metalloprotease domain, cysteine rich domain, and TSP2; signal 
sequence, cysteine rich domain, TSPl and TSP2; prodomain and metalloprotease domain; 
prodomain and cysteine rich domain; prodomain and TSPl; prodomain and TSP2; 
prodomain, metalloprotease domain and cysteine rich domain; prodomain, metalloprotease 
domain and TSPl; prodomain, metalloprotease domain and TSP2; prodomain, 
metalloprotease domain, cysteine rich domain and TSPl; prodomain, metalloprotease 
domain, cysteine rich domain and TSP2; prodomain, cysteine rich domain and TSPl; 
prodomain, cysteine rich domain and TSP2; prodomain, metalloprotease domain, cysteine 
rich domain, TSPl and TSP2; prodomain, cysteine rich domain, TSPl and TSP2; 
prodomain, metalloprotease domain, TSPl and TSP2; prodomain, metalloprotease domain, 
TSPl and TSP2; prodomain, TSPl and TSP2; prodomain, metalloprotease domain, TSPl 
and TSP2; metalloprotease domain and cysteine rich domain; metalloprotease domain and 
TSP 1 ; metalloprotease domain and TSP2; metalloprotease domain, cysteine rich domain and 
TSPl; metalloprotease domain, cysteine rich domain and TSP2; metalloprotease domain, 
cysteine rich domain, TSPl and TSP2; metalloprotease domain, TSPl and TSP2; cysteine 
rich domain and TSPl; cysteine rich domain and TSP2; cysteine rich domain, TSPl and 
TSP2. These domains may be present in the METH2 molecule in the same order or a 
different order than in the naturally occurring molecule. Also preferred are polypeptides 
encoded by such nucleic acids. 

Additionally, METHl and METH2 domains may be combined to form hybrid 
molecules. Any domain of METHl may be combined with any domain of METH2 to form 
a hybrid molecule. For example, the TSPl domain of METHl may be replaced with the 
TSPl domain of METH2 to form a hybrid molecule, leaving the remainder of the METHl 
molecule intact. Also, the TSPl domain of METHl may be replaced with the TSP2 domain 
of METH2 to form a hybrid molecule, leaving the remainder of the METHl molecule intact. 
Additionally, the TSPl domain of METHl may be combined with the TSP2 domain of 
METH2 to form a hybrid molecule, without any additional METHl and/or METH2 
sequences. These domains may be present in the same or a different order as occurs in the 
naturally occurring molecules. Also preferred are polypeptides encoded by such nucleic 
acids. 
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Further embodiments include nucleic acids encoding a METHl or METH2 
polypeptide in which: one or more TSP domains have been replaced with other known TSP 
domains; the metalloprotease domain has been replaced with another known metalloprotease 
domain; the disintegrin domain has been replaced with another known disintegrin domain. 
One or more domains may be replaced in this manner. For example, the both the 
metalloprotease and disintegrin domains may be replaced. Alternatively, all three TSP 
domains may be replaced. Also preferred are polypeptides encoded by such nucleic acids. 

Preferred embodiments are polynucleotides encoding the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:4 except for several, 5-10, 1-5, 1-3, 1-2 or 1 amino acid 
residues are substituted, deleted or added, in any combination. 

In addition, the present inventors have identified the following cDNA clones related 
to portions of the sequence shown in SEQ ID NO:l: HOUCQ17RA (SEQ ID NO: 14), 
HPLBMl IR (SEQ ID NO: 15), HGBI07R (SEQ ID NO: 16), HNTMA49R (SEQ ID NO: 17), 
HNALE27R (SEQ ID NO: 18), and HIBDB45R (SEQ ID NO: 19). 

The following public ESTs, which relate to portions of SEQ ID NO: 1 , have also been 
identified: D67076 (SEQ ID NO:20), AB001735 (SEQ ID NO:21), X14787 (SEQ ID 
NO:22), U64857 (SEQ ID NO:23), X04665 (SEQ ID NO:24), M64866 (SEQ ID NO:25), 
L07803 (SEQ ID NO:26), U08006 (SEQ ED NO:27), M16974 (SEQ ID NO:28), L13855 
(SEQ ID NO:29), AL021529 (SEQ ID NO:30), D86074 (SEQ ID NO:3 1), L05390 (SEQ ID 
NO:32), Z69361 (SEQ ID NO:33), X99599 (SEQ ID NO:34), AF018073 (SEQ ID NO:35), 
L23760 (SEQ ID NO:36), Z46970 (SEQ ID NO:37), AC004449 (SEQ ID NO:38), Z69589 
(SEQ ID NO:39), Z22279 (SEQ ID NO:40), X17524 (SEQ ID NO:41), AI126Q19 (SEQ ID 
NO: 103), AI571069 (SEQ ID NO: 104), AI148739 (SEQ ID NO: 105), AI335849 (SEQ ID 
NO: 106), AA6771 16 (SEQ ID NO: 107), H27128 (SEQ ID NO: 108), AA368429 (SEQ ID 
NO: 109), AA3458 12 (SEQ ID NO: 1 10), AA3737 18 (SEQ ID NO: 1 1 1), AI5375 18 (SEQ ID 
NO: 112), N88341 (SEQ ID NO:113), C03600 (SEQ ID NO:114), AA066142 (SEQ ID 
NO: 115), AI40095 (SEQ ID NO:94), AA288689 (SEQ ID NO: 116), AI464076 (SEQ ID 
NO:97), R13547 (SEQ ID NO: 1 17), R19976 (SEQ ID NO: 118), Z43925 (SEQ ID NO: 1 19), 
AA670987 (SEQ ID NO: 120), AA635657 (SEQ ID NO:96), W24878 (SEQ ID NO: 121), 
W47316 (SEQIDNO:122), W35345 (SEQ ID NO: 123), andN27243 (SEQ ID NO: 124). 

The present inventors have also identified the following cDNA clones related to 
portions of SEQ ID NO:3: HCE4D69FP02 (SEQ ID NO:42), HIBDB45F (SEQ ID NO:43), 
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HKIXH64R (SEQ ID NO:44), HIBDB45R (SEQ ID NO: 19), HCE3Z95R (SEQ ID NO:45), 
HTLEQ90R (SEQ ID NO:46), HMWEF45R (SEQ ID NO:47), HTOFC34RA (SEQ ID 
NO:48), HHFDI20R (SEQ ID NO:49), HMSHY47R (SEQ ID NO:50), HCESF90R (SEQ 
ID NO:51), HMCA046R (SEQ ID NO:52), HTTAQ67R (SEQ ID NO:53), HFKCF19F 
(SEQ ID NO:54), HMCAS31R (SEQ ID NO:55), HMWGP26R (SEQ ID NO:56), 
HLHTP36R (SEQ ID NO:57), HE8AN11R (SEQ ID NO:58), HEONN73R (SEQ ID 
NO:59), HBNBG53R (SEQ ID NO:60), and HMSCH94R (SEQ ID NO:61). 

The following public ESTs, which relate to portions of the sequence shown in SEQ 
ID NO:3, have also been identified: D67076(SEQIDNO:20), AB001735 (SEQIDNO:21), 
AB005287 (SEQ ID NO:62), X87619 (SEQ ID NO:63), X14787 (SEQ ID NO:22), X04665 
(SEQ ID NO:24), M87276 (SEQ ID NO:64), M62458 (SEQ ID NO:65), AB002364 (SEQ 
ID NO:66), AB005297 (SEQ ID NO:67), X69161 (SEQ ID NO:68), X16619 (SEQ ID 
NO:69), 136448 (SEQ ID NO:70), L12260 (SEQ ID NO:71), 136352 (SEQ ID NO:72), 
X15898 (SEQ ID NO:73), 107789 (SEQ ID NO:74), 108 144 (SEQ ID NO:75) U3 1 8 14 (SEQ 
ID NO:76), AF001444 (SEQ ID NO:77), AI400905 (SEQ ID NO:94), AI378857 (SEQ ID 
NO:95), AA635657 (SEQ ID NO:96), AI464076 (SEQ ID NO:97), C06578 (SEQ ID 
NO:98), AA855532 (SEQ ID NO:99), HI 1881 (SEQ ID NO: 100), AA350801 (SEQ ID 
NO: 101), and AA350802 (SEQ ID NO: 102). 

In specific embodiments, the polynucleotides of the invention are less than 300 kb, 
200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further embodiment, 
polynucleotides of the invention comprise at least 15 contiguous nucleotides of METHl or 
METH2 coding sequence, but do not comprise all or a portion of any METHl or METH2 
intron. In another embodiment, the nucleic acid comprising METHl or METH2 coding 
sequence does not contain coding sequences of a genomic flanking gene (i.e., 5' or 3' to the 
METHl or METH2 gene in the genome). 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization conditions to 
a portion of the polynucleotide in a nucleic acid molecule of the invention described above, 
for instance, the cDNA clones contained in ATCC Deposit No. 209581 or ATCC Deposit 
No. 209582. By "stringent hybridization conditions" is intended overnight incubation at 
42°C in a solution comprising: 50% formamide, 5x SSC (750 mM NaCl, 75mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate. 



-108- 



and 20 lig/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lxSSC at about 65"C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide is intended 
a polynucleotide (either DNA or RNA) hybridizing to at least about 15 nucleotides (nt), and 
5 more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 

preferably about 30, 40, 50, 60 or 70 nt of the reference polynucleotide. These are useful 
as diagnostic probes and primers as discussed above and in more detail below. 

By aportion of a polynucleotide of "at least 20 nt in length," for example, is intended 
20 or more contiguous nucleotides from the nucleotide sequence of the reference 

10 polynucleotide (e.g., the deposited cDNAs or the nucleotide sequence as shown in SEQ ID 

NO:l or SEQ ID NO:3). Of course, a polynucleotide which hybridizes only to a poly A 
sequence (such as the 3' terminal poly(A) tract of the METHl or METH2 cDNA shown in 
SEQ ID NO: 1 and SEQ ID NO: 3, respectively) or to a complementary stretch of T (or U) 
resides, would not be included in a polynucleotide of the invention used to hybridize to a 

15 portion of a nucleic acid of the invention, since such a polynucleotide would hybridize to 

any nucleic acid molecule containing a poly (A) stretch or the complement thereof (e.g., 
practically any double-stranded cDNA clone). 

Also contemplated are nucleic acid molecules that hybridize to the METHl or 
METH2 polynucleotides at moderately high stringency hybridization conditions. Changes 

20 in the stringency of hybridization and signal detection are primarily accomplished through 

the manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. For example, moderately high 
stringency conditions include an overnight incubation at 3 ° C in a solution comprising 6X 
SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% 

25 formamide, 100 |Lig/ml salmon sperm blocking DNA; followed by washes at 50°C with 

IXSSPE, 0.1% SDS. In addition, to achieve even lower stringency, washes performed 
following stringent hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background in 

30 hybridization experiments. Typical blocking reagents include Denhardt's reagent, 

BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary 
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formulations. The inclusion of specific blocking reagents may require modification of the 
hybridization conditions described above, due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 
3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary 

5 stretch of T (or U) residues, would not be included in the definition of "polynucleotide," 

since such a polynucleotide would hybridize to any nucleic acid molecule containing a poly 
(A) stretch or the complement thereof (e.g., practically any double-stranded cDNA clone). 

The METH 1 or METH2 polynucleotide can be composed of any polyribonucleotide 
or polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or 

10 DNA. For example, METHl or METH2 polynucleotides can be composed of single- and 

double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- 
and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, 
hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, 
double-stranded or a mixture of single- and double-stranded regions. Li addition, the 

1 5 METH 1 or METH2 polynucleotides can be composed of triple-stranded regions comprising 

RNA or DNA or both RNA and DNA. METHl or METH2 polynucleotides may also 
contain one or more modified bases or DNA or RNA backbones modified for stability or for 
other reasons. "Modified" bases include, for example, tritylated bases and unusual bases 
such as inosine. A variety of modifications can be made to DNA and RNA; thus, 

20 "polynucleotide" embraces chemically, enzymatically, or metabolically modified forms. 

"SEQ ID NO: 1 " refers to a METHl polynucleotide sequence while "SEQ ID NO:2" 
refers to a METHl polypeptide sequence. "SEQ ID NO: 3" refers to a METH2 
polynucleotide sequence while "SEQ ID NO:4" refers to a METH2 polypeptide sequence. 
As indicated, nucleic acid molecules of the present invention which encode a 

25 METHl or METH2 polypeptide may include, but are not limited to, those encoding the 

amino acid sequence of the mature polypeptide, by itself; the coding sequence for the mature 
polypeptide and additional sequences, such as those encoding the leader or secretory 
sequence, such as a pre-, or pro- or prepro- protein sequence; the coding sequence of the 
mature polypeptide, with or without the aforementioned additional coding sequences, 

30 together with additional, non-coding sequences, including for example, but not limited to 

introns and non-coding 5' and 3' sequences, such as the transcribed, non-translated 
sequences that play a role in transcription, mRNA processing, including splicing and 
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polyadenylation signals, for example - ribosome binding and stability of mRNA; an 
additional coding sequence which codes for additional amino acids, such as those which 
provide additional functionaries. Thus, the sequence encoding the polypeptide may be 
fused to a marker sequence, such as a sequence encoding a peptide which facilitates 
purification of the fused polypeptide. In certain preferred embodiments of this aspect of the 
invention, the marker amino acid sequence is a hexa-histidine peptide, such as the tag 
provided in a pQE vector (Qiagen, Mc), among others, many of which are commercially 
available. As described in Gentz et al, Proc, Natl Acad ScL USA 8(5:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. The 
"HA" tag is another peptide useful for purification which corresponds to an epitope derived 
from the influenza hemagglutinin protein, which has been described by Wilson et al, Cell 
37:161-11% (1984). As discussed below, other such fusion proteins include the METHl or 
METH2 fused to Fc at the N- or C-terminus. Other fusion proteins include METHl or 
METH2 fused to Flag at the N- or C-terminus. Other fusion proteins include METHl 
fragments or METH2 fragments fused to Flag or Fc at the N- or C- terminus. Particularly 
preferred are fragments of METHl or METH2, such as H541-Q894, M1-P799, F236-E614, 
or K801-Q950 of SEQ ID NO:2, fused to Fc or Flag at the N- or C-terminus. 

As stated above, METHl or METH 2 may be fused with the FLAG polypeptide 
sequence (see U.S. Pat. No. 4,851,341; see also Hopp etal, Bio/Technology 6: 1204, 1988). 
The FLAG polypeptide sequence is highly antigenic and provides an epitope for binding by 
a specific monoclonal antibody, enabling rapid purification of the expressed recombinant 
protein. This sequence is also specifically cleaved by bovine mucosal enterokinase at the 
residue inamediately following the Asp-Lys pairing. 

The present invention further relates to variants of the nucleic acid molecules of the 
present invention, which encode portions, analogs or derivatives of the METHl or METH2 
protein. Variants may occur naturally, such as a natural allelic variant. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism, Lewin, B., ed., Genes II, John Wiley & Sons, New York 
(1985). Non-naturally occurring variants may be produced using art-known mutagenesis 
techniques. 

Such variants include those produced by nucleotide substitutions, deletions or 
additions, which may involve one or more nucleotides. The variants may be altered in 
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coding regions, non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitutions, deletions or additions. 
Especially preferred among these are silent substitutions, additions and deletions, which do 
not alter the properties and activities of the METHl or METH2 protein or portions thereof. 
5 Also especially preferred in this regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid molecules 
comprising a polynucleotide having a nucleotide sequence at least 95% identical, and more 
preferably at least 96%, 97%, 98% or 99% identical to: a nucleotide sequence encoding the 
polypeptide having the amino acid sequence in SEQ ID NO;2; a nucleotide sequence 

10 encoding the polypeptide having the amino acid sequence in SEQ ID NO:2, but lacking the 

N-terminal methionine; a nucleotide sequence encoding the polypeptide having the amino 
acid sequence at positions from about 29 to about 950 in SEQ ID NO:2; a nucleotide 
sequence encoding the polypeptide having the amino acid sequence at position from about 
30 to about 950 in SEQ ID NO:2; a nucleotide sequence encoding the polypeptide having 

15 the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 

20958 1 ; a nucleotide sequence encoding the mature METHl polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 209581; a 
nucleotide sequence encoding amino acids 235 to 459 in SEQ ED NO:2 (the metalloprotease 
domain of METHl); a nucleotide sequence encoding amino acids 460 to 544 in SEQ ID 

20 NO:2 (the disintegrin domain of METHl); a nucleotide sequence encoding amino acids 545 

to 598 in SEQ ID NO:2 (the first TSP-like domain of METHl); a nucleotide sequence 
encoding amino acids 841 to 894 in SEQ ID NO:2 (the second TSP-like domain of 
METHl); a nucleotide sequence encoding amino acids 895 to 934 in SEQ ID NO:2 (the 
third TSP-like domain of METHl); a nucleotide sequence encoding amino acids 536 to 613 

25 in SEQ ID NO:2; a nucleotide sequence encoding amino acids 549 to 563 in SEQ ID NO:2; 

a nucleotide sequence encoding the polypeptide having the amino acid sequence in SEQ ID 
NO:4; a nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ED NO:4, but lacking the N-terminal methionine; a nucleotide sequence encoding the 
polypeptide having the amino acid sequence at positions from about 24 to about 890 in SEQ 

30 ID NO:4; a nucleotide sequence encoding the polypeptide having the amino acid sequence 

at positions from about 1 12 to about 890 in SEQ ID NO:4; a nucleotide sequence encoding 
the polypeptide having the amino acid sequence encoded by the cDNA clone contained in 
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ATCC Deposit No. 209582; a nucleotide sequence encoding the mature METH2 
polypeptide having the amino acid sequence encoded by the cDNA clone contained in 
ATCC Deposit No, 209582; a nucleotide sequence encoding amino acids 2 14 to 439 in SEQ 
ID NO:4 (the metalloprotease domain of METH2); a nucleotide sequence encoding amino 
5 acids 440 to 529 in SEQ ID NO:4 (the disintegrin domain of METH2); a nucleotide 

sequence encoding amino acids 530 to 583 in SEQ ID NO:4 ( the first TSP-like domain of 
METH2); a nucleotide sequence encoding amino acids 837 to 890 in SEQ ID NO:4 (the 
second TSP-like domain of METH2); a nucleotide sequence encoding amino acids 280 to 
606 in SEQ ID NO:4; a nucleotide sequence encoding amino acids 529 to 548 in SEQ ID 
10 NO:4; or a nucleotide sequence complementary to any of the above nucleotide sequences. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
J't "identical" to a reference nucleotide sequence encoding a METHl or METH2 polypeptide 

UJ is intended that the nucleotide sequence of the polynucleotide is identical to the reference 

lj= sequence except that the polynucleotide sequence may include up to five point mutations per 

f5 each 1 00 nucleotides of the reference nucleotide sequence encoding the METH 1 or METH2 

CJ polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 

U 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 

^J:' reference sequence may be deleted or substituted with another nucleotide, or a number of 

nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into 
20 the reference sequence. These mutations of the reference sequence may occur at the 5' or 

3' terminal positions of the reference nucleotide sequence or anywhere between those 
terminal positions, interspersed either individually among nucleotides in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at least 95%, 
25 96%, 97%, 98% or 99% identical to, for instance, the nucleotide sequence shown in SEQ 

ID NO: 1 or SEQ ID NO:3 or to the nucleotide sequence of the deposited cDNA clones can 
be determined conventionally using known computer programs such as the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, 
University Research Park, 575 Science Drive, Madison, WI 5371 1). Bestfit uses the local 
30 homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2 : 482-489 

(1981), to find the best segment of homology between two sequences. When using Bestfit 
or any other sequence alignment program to determine whether a particular sequence is, for 
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instance, 95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated over the full 
length of the reference nucleotide sequence and that gaps in homology of up to 5% of the 
total number of nucleotides in the reference sequence are allowed. 

A preferred method for determining the best overall match between a query sequence 
(a sequence of the present invention) and a subject sequence, also referred to as a global 
sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag etal , Comp, Appl BioscL (5:237-245 (1990). In a sequence alignment, 
the query and subject sequences are both DNA sequences. An RNA sequence can be 
compared by converting U's to T's. The result of said global sequence alignment is in 
percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4. Mismatch Penalty=l, Joining 
Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject nucleotide sequence, whichever 
is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the results. 
This is because the FASTDB program does not account for 5' and 3' truncations of the 
subject sequence when calculating percent identity. For subject sequences truncated at the 
5' or 3' ends, relative to the query sequence, the percent identity is corrected by calculating 
the number of bases of the query sequence that are 5' and 3' of the subject sequence, which 
are not matched/aligned, as a percent of the total bases of the query sequence. Whether a 
nucleotide is matched/aligned is determined by the results of the FASTDB sequence 
alignment. This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final percent identity 
score. This corrected score is what is used for the purposes of the present invention. Only 
bases outside the 5' and 3' bases of the subject sequence, as displayed by the FASTDB 
alignment, which are not matched/aligned with the query sequence are calculated for the 
purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query sequence to 
determine percent identity. The deletions occur at the 5' end of the subject sequence and, 
therefore, the FASTDB alignment does not show a match/alignment of the first 10 bases at 
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the 5' end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5' 
and 3' ends not matched/total number of bases in the query sequence), so 10% is subtracted 
from the percent identity score calculated by the FASTDB program. If the remaining 90 
bases were perfectly matched the final percent identity would be 90%. In another example, 
a 90 base subject sequence is compared with a 100 base query sequence. This time the 
deletions are internal, so that there are no bases on the 5' or 3' ends of the subject sequence 
which are not matched/aligned with the query. In this case, the percent identity calculated 
by FASTDB is not manually corrected. Once again, only bases 5' and 3' of the subject 
sequence which are not matched/aligned with the query sequence are manually corrected for. 
No other manual corrections are to be made for the purposes of the present invention. 

The present application is directed to nucleic acid molecules at least 95%, 96%, 97%, 
98% or 99% identical to the nucleic acid sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 
or to the nucleic acid sequence of the deposited cDNAs, irrespective of whether they encode 
a polypeptide having METHl or METH2 activity. This is because even where a particular 
nucleic acid molecule does not encode a polypeptide having METHl or METH2 activity, 
one of skill in the art would still know how to use the nucleic acid molecule, for instance, 
as a hybridization probe or a polymerase chain reaction (PGR) primer. Uses of the nucleic 
acid molecules of the present invention that do not encode a polypeptide having METHl or 
METH2 activity include, inter alia, (1) isolating the METHl or METH2 gene or allelic 
variants thereof in a cDNA library; (2) in situ hybridization (e.g., "FISH") to metaphase 
chromosomal spreads to provide precise chromosomal location of the METHl or METH2 
gene, as described in Verma et al, Human Chromosomes: A Manual of Basic Techniques, 
Pergamon Press, New York (1988); and (3) Northern Blot analysis for detecting METHl or 
METH2 mRNA expression in specific tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ID NO: 1 or SEQ ID 
NO:3 or to a nucleic acid sequence of the deposited cDNAs which do, in fact, encode a 
polypeptide having METHl or METH2 protein activity. By "a polypeptide having METHl 
activity" is intended polypeptides exhibiting METH 1 activity in a particular biological assay. 
For example, METHl protein activity can be measured using the chorioallantoic membrane 
assay (Iruela-Arispe et al, Thrombosis and Haemostasis 78(1)\612-611 (1997)) or the 
cornea pocket assay (Tolsma et al, J. Cell Biol 722:497-511 (1993)), both described in 
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Example 4, below. By "a polypeptide having METH2 activity" is intended polypeptides 
exhibiting METH2 activity in a particular biological assay. For example, METH2 protein 
activity can also be measured using the chorioallantoic membrane assay (Iruela- Arispe et al, 
Thrombosis and Haemostasis 7Sf7J:672-677 (1997)) or the cornea pocket assay (Tolsma et 
aU /• Cell Biol 722:497-51 1 (1993)), both described in Example 4, below. 

Briefly, in the chorioallantoic assay, the potentially anti-angiogenic compound of 
interest is added to type I collagen pellets (Vitrogen), along with an angiogenic growth 
factor, such as bFGF. The samples are mixed and placed onto nylon meshes, and allowed 
to polymerize. After polymerization is complete, the meshes are placed onto the 
chorioallantoic membrane of 12 day old chick embryos and placed at 37 °C for 24 hours. 
The embryos are then injected with a fluorescent agent, such as FITC-dextran, and the 
meshes are fixed and mounted for observation under a fluorescent microscope. 

In the cornea pocket assay, hydron pellets containing the compound of interest and 
an angiogenic growth factor, such as bFGF, are implanted 1 to 2mm from the limbus of the 
cornea of rats or mice. Response is examined after a period of time, for example 5 days. 
The extent of angiogenesis is evaluated by measuring the capillaries migrating from the limb 
of the cornea. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art 
will immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence of the 
deposited cDNAs or a nucleic acid sequence shown in SEQ ID NO: 1 or SEQ ID NO: 3 will 
encode a polypeptide "having METH 1 or METH2 protein activity. " In fact, since degenerate 
variants of these nucleotide sequences all encode the same polypeptide, this will be clear to 
the skilled artisan even without performing the above described comparison assay. It will 
be further recognized in the art that, for such nucleic acid molecules that are not degenerate 
variants, a reasonable number will also encode a polypeptide having METHl or METH2 
protein activity. This is because the skilled artisan is fully aware of amino acid substitutions 
that are either less likely or not likely to significantly effect protein function (e.g., replacing 
one aliphatic amino acid with a second aUphatic amino acid). 

For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie, J. U. et al, "Deciphering the Message in Protein 
Sequences: Tolerance to Amino Acid Substitutions," Science 247.T306-1310 (1990), 
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wherein the authors indicate that proteins are surprisingly tolerant of amino acid 
substitutions. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated DNA 
molecules of the present invention, host cells which are genetically engineered with the 
recombinant vectors, and the production of METH 1 or METH2 polypeptides or fragments 
thereof by recombinant techniques. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a 
calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, 
it may be packaged in vitro using an appropriate packaging cell line and then transduced into 
host cells. 

The DNA insert should be operatively linked to an appropriate promoter, such as the 
phage lambda PL promoter, the E, coli lac, trp and tac promoters, the S V40 early and late 
promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will 
be known to the skilled artisan. The expression constructs will further contain sites for 
transcription initiation, termination and, in the transcribed region, a ribosome binding site 
for translation. The coding portion of the mature transcripts expressed by the constructs will 
preferably include a translation initiating at the beginning and a termination codon (UAA, 
UGA or UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable 
marker. Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic 
cell culture and tetracycline or ampicillin resistance genes for culturing in E, coli and other 
bacteria. Representative examples of appropriate hosts include, but are not limited to, 
bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, 
such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells 
such as CHO, COS and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 



-117- 



Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, 
pNH16a, pNHlSA, pNH46A, available from Stratagene; and ptrc99a, pKK223-3, 
pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic vectors 
are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, 
pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily 
apparent to the skilled artisan. 

In addition to the use of expression vectors in the practice of the present invention, 
the present invention further includes novel expression vectors comprising operator and 
promoter elements operatively linked to nucleotide sequences encoding a protein of interest. 
One example of such a vector is pHE4-5 which is described in detail below. 

As summarized in Figures 8 and 9, components of the pHE4-5 vector (SEQ ID 
NO: 12) include: 1) a neomycinphosphotransferase gene as a selection marker, 2) an E. coli 
origin of replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 
Shine-Delgarno sequence, 6) the lactose operon repressor gene (/aclq). The origin of 
replication (oriC) is derived from pUC 1 9 (LTI, Gaithersburg, MD). The promoter sequence 
and operator sequences were made synthetically. Synthetic production of nucleic acid 
sequences is well known in the art. Clontech 95/96 Catalog, pages 215-216, Clontech, 
1020 East Meadow Circle, Palo Alto, CA 94303. A nucleotide sequence encoding METHl 
(SEQ ID NO:2) or METH2 (SEQ ID NO:4), is operatively linked to the promoter and 
operator by inserting the nucleotide sequence between the Ndel and Asp? 18 sites of the 
pHE4-5 vector. 

As noted above, the pHE4-5 vector contains a /aclq gene. Laclq is an allele of the 
lad gene which confers tight regulation of the lac operator. Amann, E. et al, Gene 69:30 1 - 
315(1 988); Stark, M., Gene 51 -.ISS-iei ( 1 987). The Zaclq gene encodes a repressor protein 
which binds to lac operator sequences and blocks transcription of down-stream {i.e., 3') 
sequences. However, the laclq gene product dissociates from the lac operator in the 
presence of either lactose or certain lactose analogs, e.g., isopropyl 
B-D-thiogalactopyranoside (IPTG). METHl or METH2 thus is not produced in appreciable 
quantities in uninduced host cells containing the pHE4-5 vector. Induction of these host 
cells by the addition of an agent such as IPTG, however, results in the expression of the 
METHl or METH2 coding sequence. 
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The promoter/operator sequences of the pHE4-5 vector (SEQ ID NO: 13) comprise 
a T5 phage promoter and two lac operator sequences. One operator is located 5' to the 
transcriptional start site and the other is located 3' to the same site. These operators, when 
present in combination with the laclq gene product, confer tight repression of down-stream 
sequences in the absence of a lac operon inducer, e.g., IPTG. Expression of operatively 
linked sequences located down-stream from the lac operators may be induced by the 
addition of a lac operon inducer, such as IPTG. Binding of a lac inducer to the /<aclq 
proteins results in their release from the lac operator sequences and the initiation of 
transcription of operatively linked sequences. Lac operon regulation of gene expression is 
reviewed in Devlin, T., TEXTBOOK OF BIOCHEMISTRY wtth CLINICAL Correlations, 4th 
Edition (1997), pages 802-807. 

The pHE4 series of vectors contain all of the components of the pHE4-5 vector 
except for the METHl or METH2 coding sequence. Features of the pHE4 vectors include 
optimized synthetic T5 phage promoter, lac operator, and Shine-Delgarno sequences. 
Further, these sequences are also optimally spaced so that expression of an inserted gene 
may be tightly regulated and high level of expression occurs upon induction. 

Among known bacterial promoters suitable for use in the production of proteins of 
the present invention include the E. coli lad and lacZ promoters, the T3 and T7 promoters, 
the gpt promoter, the lambda PR and PL promoters and the trp promoter. Suitable 
eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine 
kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous Sarcoma Virus (RSV), and metallothionein promoters, such as the 
mouse metallothionein-I promoter. 

The pHE4-5 vector also contains a Shine-Delgarno sequence 5' to the AUG initiation 
codon. Shine-Delgarno sequences are short sequences generally located about 10 
nucleotides up-stream {i.e. , 5') from the AUG initiation codon. These sequences essentially 
direct prokaryotic ribosomes to the AUG initiation codon. 

Thus, the present invention is also directed to expression vectors useful for the 
production of the proteins of the present invention. This aspect of the invention is 
exemplified by the pHE4-5 vector (SEQ ID NO: 12). 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic Upid-mediated transfection. 
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electroporation, transduction, infection or other methods. Such methods are described in 
many standard laboratory manuals, such as Davis et al, Basic Methods In Molecular 
Biology (1986). 

The polypeptide may be expressed in a modified form, such as a fusion protein, and 
5 may include not only secretion signals, but also additional heterologous functional regions. 

For instance, a region of additional amino acids, particularly charged amino acids, may be 
added to the N-terminus of the polypeptide to improve stabiUty and persistence in the host 
cell, during purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be removed 
^,10 prior to final preparation of the polypeptide. The addition of peptide moieties to 

'^"^ polypeptides to engender secretion or excretion, to improve stability and to facilitate 

%4 purification, among others , are familiar and routine techniques in the art. A preferred fusion 

^r! protein comprises a heterologous region from immunoglobulin that is useful to solubilize 

proteins. For example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion 
i 5 proteins comprising various portions of constant region of immunoglobin molecules together 

S with another human protein or part thereof. In many cases, the Fc part in a fusion protein 

f"; is thoroughly advantageous for use in therapy and diagnosis and thus results, for example, 

y in improved pharmacokinetic properties (EP-A 0232 262). On the other hand, for some uses 

it would be desirable to be able to delete the Fc part after the fusion protein has been 
20 expressed, detected and purified in the advantageous manner described. This is the case 

when the Fc portion proves to be a hindrance to use in therapy and diagnosis, for example 
when the fusion protein is to be used as an antigen for inMnunizations. In drug discovery, 
for example, human proteins, such as the hIL5 -receptor, have been fused with Fc portions 
for the purpose of high-throughput screening assays to identify antagonists of hIL-5. See, 
25 D. Bennett et al, J. Mol Recognition S:52-58 (1995) and K. Johanson ct al, J. of Biol 

Chem. 270(16):9459-947l (1995). 

The METH 1 or METH2 protein can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
30 hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 

chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. Polypeptides of the present 
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invention include naturally purified products, products of chemical synthetic procedures, and 

products produced by recombinant techniques from a prokaryotic or eukaryotic host, 

including, for example, bacterial, yeast, higher plant, insect and mammalian cells. 

Depending upon the host employed in a recombinant production procedure, the polypeptides 
5 of the present invention may be glycosylated or may be non-glycosylated. In addition, 

polypeptides of the invention may also include an initial modified methionine residue, in 

some cases as a result of host-mediated processes. 

In addition, polypeptides of the invention can be chemically synthesized using 

techniques known in the art (e.g., see Creighton, 1983, Proteins: Structures and Molecular 
10 Principles, W.H. Freeman & Co., N. Y., and Hunkapiller, M., etal, 1984, Nature 310: 105- 

111). For example, a peptide corresponding to a fragment of the METHl and/or METH2 
W polypeptides of the invention can be synthesized by use of a peptide synthesizer. 

Ijj Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be 

introduced as a substitution or addition into the METHl and/or METH2 polynucleotide 
Cf5 sequence. Non-classical amino acids include, but are not limited to, to the D-isomers of the 

C:j common amino acids, 2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric 

acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino 
W isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, 

sarcosine, citrulline, homocitruUine, cysteic acid, t-butylglycine, t-butylalanine, 
20 phenylglycine, cyclohexylalanine, b-alanine, fluoro-amino acids, designer amino acids such 

as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 

analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 
The invention encompasses METHl and/or METH2 polypeptides which are 

differentially modified during or after translation, e,g., by glycosylation, acetylation, 
25 phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic 

cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of numerous 

chemical modifications may be carried out by known techniques, including but not limited, 

to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 

protease, NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the 
30 presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, 

for example, e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or 
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C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical 
modifications of N-linked or O-linked carbohydrate chains, and addition or deletion of an 
N-terminal methionine residue as a result of procaryotic host cell expression. The 
polypeptides may also be modified with a detectable label, such as an enzymatic, 
5 fluorescent, isotopic or affinity label to allow for detection and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of METHl 
and/or METH2 which may provide additional advantages such as increased solubility, 
stability and circulating time of the polypeptide, or decreased immunogenicity (see U. S. 
Patent No. 4, 1 79,337). The chemical moieties for derivitization may be selected from water 
1 0 soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol copolymers, 

carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides may be 
iy modified at random positions within the molecule, or at predetermined positions within the 

hi molecule and may include one, two, three or more attached chemical moieties. 

f^J The polymer may be of any molecular weight, and may be branched or unbranched. 

tf5 For polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 

ij 100 kDa (the term "about" indicating that in preparations of polyethylene glycol, some 

^ X molecules will weigh more, some less, than the stated molecular weight) for ease in handling 

'^y and manufacturing. Other sizes may be used, depending on the desired therapeutic profile 

i. y (e.g., the duration of sustained release desired, the effects, if any on biological activity, the 

20 ease in handling, the degree or lack of antigenicity and other known effects of the 

polyethylene glycol to a therapeutic protein or analog). 

The polyethylene glycol molecules (or other chemical moieties) should be attached 
to the protein with consideration of effects on functional or antigenic domains of the protein. 
There are a number of attachment methods available to those skilled in the art, e.g. , EP 0 401 
25 384, herein incorporated by reference (coupling PEG to G-CSF), see also Malik et al , Exp, 

Hematol 20: 1028- 1035 ( 1 992) (reporting pegylation of GM-CSF using tresyl chloride). For 
example, polyethylene glycol may be covalently bound through amino acid residues via a 
reactive group, such as, a free amino or carboxyl group. Reactive groups are those to which 
an activated polyethylene glycol molecule may be bound. The amino acid residues having 
30 a free amino group may include lysine residues and the N-terminal amino acid residues; 

those having a free carboxyl group may include aspartic acid residues glutamic acid residues 
and the C-terminal amino acid residue. Sulfhydryl groups may also be used as a reactive 
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group for attaching the polyethylene glycol molecules. Preferred for therapeutic purposes 
is attachment at an amino group, such as attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
polyethylene glycol as an illustration of the present composition, one may select from a 
variety of polyethylene glycol molecules (by molecular weight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein (or peptide) molecules in the reaction 
mix, the type of pegylation reaction to be performed, and the method of obtaining the 
selected N-terminally pegylated protein. The method of obtaining the N-terminally 
pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a population 
of pegylated protein molecules. Selective proteins chemically modified at the N-terminus 
modification may be accomplished by reductive alkylation which exploits differential 
reactivity of different types of primary amino groups (lysine versus the N-terminal) available 
for derivatization in a particular protein. Under the appropriate reaction conditions, 
substantially selective derivatization of the protein at the N-terminus with a carbonyl group 
containing polymer is achieved. 

The METHl and/or METH2 polypeptides of the invention may be in monomers or 
multimers (i.e., dimers, trimers, tetramers and higher multimers). Accordingly, the present 
invention relates to monomers and multimers of the METHl and/or METH2 polypeptides 
of the invention, their preparation, and compositions (preferably, pharmaceutical 
compositions) containing them. In specific embodiments, the polypeptides of the invention 
are monomers, dimers, trimers or tetramers. In additional embodiments, the multimers of 
the invention are at least dimers, at least trimers, or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used 
herein, the term homomer, refers to a multimer containing only METHl and/or METH2 
polypeptides of the invention (including METHl and/or METH2 fragments, variants, splice 
variants, and fusion proteins, as described herein). These homomers may contain METHl 
and/or METH2 polypeptides having identical or different amino acid sequences. In a 
specific embodiment, a homomer of the invention is a multimer containing only METHl 
and/or METH2 polypeptides having an identical amino acid sequence. In another specific 
embodiment, a homomer of the invention is a multimer containing METH 1 and/or METH2 
polypeptides having different amino acid sequences. In specific embodiments, the multimer 
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of the invention is a homodimer (e.g., containing METHl and/or METH2 polypeptides 
having identical or different amino acid sequences) or a homotrimer (e.g., containing 
METHl and/or METH2 polypeptides having identical and/or different amino acid 
sequences). In additional embodiments, the homomeric multimer of the invention is at least 
a homodimer, at least a homotrimer, or at least a homotetramer. 

As used herein, the term heteromer refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the METHl 
and/or METH2 polypeptides of the invention. In a specific embodiment, the multimer of 
the invention is a heterodimer, a heterotrimer, or a heterotetramer. In additional 
embodiments, the homomeric multimer of the invention is at least a homodimer, at least a 
homotrimer, or at least a homotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic 
and/or covalent associations and/or may be indirectly linked by, for example, liposome 
formation. Thus, in one embodiment, multimers of the invention, such as, for example, 
homodimers or homotrimers, are formed when polypeptides of the invention contact one 
another in solution. In another embodiment, heteromultimers of the invention, such as, for 
example, heterotrimers or heterotetramers, are formed when polypeptides of the invention 
contact antibodies to the polypeptides of the invention (including antibodies to the 
heterologous polypeptide sequence in a fusion protein of the invention) in solution. In other 
embodiments, multimers of the invention are formed by covalent associations with and/or 
between the METHl and/or METH2 polypeptides of the invention. Such covalent 
associations may involve one or more amino acid residues contained in the polypeptide 
sequence ( e.g., that recited in SEQ ID NO:2 or 4, or contained in the polypeptide encoded 
by the clone HATCK89). In one instance, the covalent associations are cross-linking 
between cysteine residues located within the polypeptide sequences which interact in the 
native (i.e., naturally occurring) polypeptide. In another instance, the covalent associations 
are the consequence of chemical or recombinant manipulation. Alternatively, such covalent 
associations may involve one or more amino acid residues contained in the heterologous 
polypeptide sequence in a METH I and/or METH2 fusion protein. In one example, covalent 
associations are between the heterologous sequence contained in a fusion protein of the 
invention (see, e.g., US Patent Number 5,478,925). In a specific example, the covalent 
associations are between the heterologous sequence contained in a METH 1 and/or METH2- 
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Fc fusion protein of the invention (as described herein). In another specific example, 
covalent associations of fusion proteins of the invention are between heterologous 
polypeptide sequence from another Fibroblast Growth Factor family member that is capable 
of forming covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., 
5 International Publication No, WO 98/49305, the contents of which are herein incorporated 

by reference in its entirety). 

The multimers of the invention may be generated using chemical techniques known 
in the art. For example, polypeptides desired to be contained in the multimers of the 
invention may be chemically cross-linked using linker molecules and linker molecule length 
10 optimization techniques known in the art (see, e.g., US Patent Number 5,478,925, which is 

;.rj herein incorporated by reference in its entirety). Additionally, multimers of the invention 

may be generated using techniques known in the art to form one or more inter-molecule 
['] cross-links between the cysteine residues located within the sequence of the polypeptides 

m desired to be contained in the multimer (see, e.g., US Patent Number 5,478,925, which is 

5 herein incorporated by reference in its entirety). Further, polypeptides of the invention may 

be routinely modified by the addition of cysteine or biotin to the C terminus or N-terminus 
CO of the polypeptide and techniques known in the art may be applied to generate multimers 

1. 1 containing one or more of these modified polypeptides (see, e.g., US Patent Number 

'^if 5,478,925, which is herein incorporated by reference in its entirety). Additionally, 

20 techniques known in the art may be applied to generate liposomes containing the polypeptide 

components desired to be contained in the multimer of the invention (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art. In one embodiment, polypeptides contained in multimers of 
25 the invention are produced recombinantly using fusion protein technology described herein 

or otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein 
incorporated by reference in its entirety). In a specific embodiment, polynucleotides coding 
for a homodimer of the invention are generated by ligating a polynucleotide sequence 
encoding a polypeptide of the invention to a sequence encoding a linker polypeptide and 
30 then further to a synthetic polynucleotide encoding the translated product of the polypeptide 

in the reverse orientation from the original C-terminus to the N-terminus (lacking the leader 
sequence) (see, e.g. , US Patent Number 5 ,478,925, which is herein incorporated by reference 
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in its entirety). In another embodiment, recombinant techniques described herein or 
otherwise known in the art are applied to generate recombinant polypeptides of the invention 
which contain a transmembrane domain (or hyrophobic or signal peptide) and which can be 
incorporated by membrane reconstitution techniques into liposomes (see, e.g., US Patent 
5 Number 5,478,925, which is herein incorporated by reference in its entirety). 

METHl and METH2 Polypeptides and Fragments 

The invention further provides an isolated METHl polypeptide having the amino 
acid sequence encoded by the deposited cDNA, or the amino acid sequence in SEQ ED 
Q NO: 2, or a peptide or polypeptide comprising a portion of the above polypeptides. The 

bjO invention also provides an isolated METH2 polypeptide having the amino acid sequence 

i ' J encoded by the deposited cDNA, or the amino acid sequence in SEQ ID NO:4, or a peptide 

or polypeptide comprising a portion of the above polypeptides. 
CO Polypeptides of the present invention encompass not only full length polypeptides, 

I- but the mature, proprotein, processed forms of the protein, deletion mutants, substitution 

^ variants, allelic variants, analogs, derivatives, etc. 

METHl or METH2 polypeptides can be composed of amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may contain 
amino acids other than the 20 gene-encoded amino acids. The METHl or METH2 
polypeptides may be modified by either natural processes, such as posttranslational 
20 processing, or by chemical modification techniques which are well known in the art. Such 

modifications are well described in basic texts and in more detailed monographs, as well as 
in a voluminous research literature. Modifications can occur anywhere in the METHl or 
METH2 polypeptide, including the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. It will be appreciated that the same type of modification may be 
25 present in the same or varying degrees at several sites in a given METHl or METH2 

polypeptide. Also, a given METHl or METH2 polypeptide may contain many types of 
modifications. METHl orMETH2 polypeptides may be branched, for example, as a result 
of ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, and 
branched cyclic METHl or METH2 polypeptides may result from posttranslation natural 
30 processes or may be made by synthetic methods. Modifications include acetylation. 
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acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment 
of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross- 
linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross- 
5 links, formation of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, 

glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristoylation, 
oxidation, pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND 
10 MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton,W. H. Freeman and Company, 

□ New York (1993); POSTTRANSLATIONAL COVALENT MODIFICATION OF 

PROTEINS, B. C. Johnson, Ed., Academic Press, New York, pgs. M2 (1983); Seifter et 
j al.Meth Enzymol 182:626-646 (1990); Rattan etaL.Ann NYAcadSci 663:48-62 (1992).) 

Cm It will be recognized in the art that some amino acid sequences of the METHl and 

f |S METH2 polypeptides can be varied without significant effect of the structure or function of 

^ the protein. If such differences in sequence are contemplated, it should be remembered that 

CG there will be critical areas on the protein which determine activity. 

1 .1 The present inventors have shown that METH 1 and METH2 inhibit angiogenesis in 

'^z vitro and in vivo, METH 1 and METH2 each contain a metalloprotease domain, a disintegrin 

20 domain, and TSP-like domains. The metalloprotease domain may be catalytically active. 

The disintegrin domain may play a role in inhibiting angiogenesis by interacting with 
integrins, since integrins are essential for the mediation of both proliferative and migratory 
signals. The present inventors have shown that peptides derived from the TSP-like domains 
of METHl and METH2 inhibit angiogenesis in vitro and in vivo. 
25 Thus, the invention further includes variations of the METHl polypeptide which 

show substantial METHl polypeptide activity or which include regions of METHl protein 
such as the protein portions discussed below; and variations of the METH2 polypeptide 
which show substantial METH2 polypeptide activity or which include regions of METH2 
protein such as the protein portions discussed below. Such mutants include deletions, 
30 insertions, inversions, repeats, and type substitutions. As indicated above, guidance 

concerning which amino acid changes are likely to be phenotypically silent can be found in 
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Bowie, J,U., et al, "Deciphering the Message in Protein Sequences: Tolerance to Amino 
Acid Substitutions," Science 247:1306-1310 (1990). 

Thus, the fragment, derivative or analog of the polypeptide of SEQ ID NO:2 or SEQ 
ID N0:4, or that encoded by the deposited cDNA, may be (i) one in which one or more of 
5 the amino acid residues are substituted with a conserved or non-conserved amino acid 

residue (preferably a conserved amino acid residue) and such substituted amino acid residue 
may or may not be one encoded by the genetic code, or (ii) one in which one or more of the 
amino acid residues includes a substituent group, or (iii) one in which the mature 
polypeptide is fused with another compound, such as a compound to increase the half-life 
10 of the polypeptide (for example, polyethylene glycol), or (iv) one in which the additional 

[3 amino acids are fused to the mature polypeptide, such as an IgG Fc fusion region peptide or 

M leader or secretory sequence or a sequence which is employed for purification of the mature 

f polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are deemed 
fJj to be within the scope of those skilled in the art from the teachings herein, 

f |j5 Of particular interest are substitutions of charged amino acids with another charged 

amino acid and with neutral or negatively charged amino acids. The latter results in proteins 
with reduced positive charge to improve the characteristics of the METHl or METH2 
; :j proteins. The prevention of aggregation is highly desirable. Aggregation of proteins not 

' only results in a loss of activity but can also be problematic when preparing pharmaceutical 

20 formulations, because they can be immunogenic. (Pinckard et al, Clin, Exp. Immunol 

2:331-340 (1967); Robbins et aU Diabetes 56:838-845 (1987); Cleland et aU Crit. Rev, 
Therapeutic Drug Carrier Systems 10:301 -371 (1993)). 

As indicated, changes are preferably of a minor nature, such as conservative amino 
acid substitutions that do not significantly affect the folding or activity of the protein (see 
25 Table 3). 
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TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



CJ Of course, the number of amino acid substitutions a skilled artisan would make 

1 1 depends on many factors, including those described above. Generally speaking, the number 

^30 of amino acid substitutions for any given METHl or METH2 polypeptide will not be more 

than 50, 40, 30, 20, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1. 

In particular, preferred METHl molecules contain one or more of the following 
conservative substitutions: Ml replaced with A, G, I, L, S, T, or V; G2 replaced with A, I, 
L, S, T, M, or V; N3 replaced with Q; A4 replaced with G, I, L, S, T, M, or V; E5 replaced 
15 with D; R6 replaced with H, or K; A7 replaced with G, I, L, S, T, M, or V; G9 replaced with 

A, I, L, S, T, M, or V; S 10 replaced with A, G, I, L, T, M, or V; Rl 1 replaced with H, or K; 
S 12 replaced with A, G, I, L, T, M, or V; F13 replaced with W, or Y; G14 replaced with A, 
I, L, S, T, M, or V; V16 replaced with A, G, I, L, S, T, or M; T18 replaced with A, G, I, L, 
S, M, or V; L19 replaced with A, G, I, S, T, M, or V; L20 replaced with A, G, I, S, T, M, or 
20 V; L21 replaced with A, G, I, S, T, M, or V; L22 replaced with A, G, I, S, T, M, or V; A23 

replaced with G, I, L, S, T, M, or V; A24 replaced with G, I, L, S, T, M, or V; A25 replaced 
with G, I, L, S, T, M, or V; L26 replaced with A, G, I, S, T, M, or V; L27 replaced with A, 
G, I, S, T, M, or V; A28 replaced with G, I, L, S, T, M, or V; V29 replaced with A, G, I, L, 
S, T, or M; S30 replaced with A, G, I, L, T, M, or V; D31 replaced with E; A32 replaced 
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with G, I, L, S, T, M, or V; L33 replaced with A, G, I, S, T, M, or V; G34 replaced with A, 
I, L, S, T, M, or V; R35 replaced with H, or K; S37 replaced with A, G, I, L, T, M, or V; E38 
replaced with D; E39 replaced with D; D40 replaced with E; E41 replaced with D; E42 
replaced with D; L43 replaced with A, G, I, S, T, M, or V; V44 replaced with A, G, I, L, S, 
T, or M; V45 replaced with A, G, I, L, S, T, or M; E47 replaced with D; L48 replaced with 
A, G, I, S, T, M, or V; E49 replaced with D; R50 replaced with H, or K; A51 replaced with 
G, I, L, S, T, M, or V; G53 replaced with A, I, L, S, T, M, or V; H54 replaced with K, or R; 
G55 replaced with A, I, L, S, T, M, or V; T56 replaced with A, G, I, L, S, M, or V; T57 
replaced with A, G, I, L, S, M, or V; R58 replaced with H, or K; L59 replaced with A, G, 
I, S, T, M, or V; R60 replaced with H, or K; L61 replaced with A, G, I, S, T, M, or V; H62 
replaced with K, or R; A63 replaced with G, I, L, S, T, M, or V; F64 replaced with W, or Y; 
D65 replaced with E; Q66 replaced with N; Q67 replaced with N; L68 replaced with A, G, 
I, S, T, M, or V; D69 replaced with E; L70 replaced with A, G, I, S, T, M, or V; E71 
replaced with D; L72 replaced with A, G, I, S, T, M, or V; R73 replaced with H, or K; D75 
replaced with E; S76 replaced with A, G, I, L, T, M, or V; S77 replaced with A, G, I, L, T, 
M, or V; F78 replaced with W, or Y; L79 replaced with A, G, I, S , T, M, or V; A80 replaced 
with G, I, L, S, T, M, or V; G82 replaced with A, I, L, S, T, M, or V; F83 replaced with W, 
or Y; T84 replaced with A, G, I, L, S, M, or V; L85 replaced with A, G, I, S, T, M, or V; 
Q86 replaced with N; N87 replaced with Q; V88 replaced with A, G, I, L, S, T, or M; G89 
replaced with A, I, L, S, T, M, or V; R90 replaced with H, or K; K9 1 replaced with H, or R; 
S92 replaced with A, G, I, L, T, M, or V; G93 replaced with A, I, L, S, T, M, or V; S94 
replaced with A, G, I, L, T, M, or V; E95 replaced with D; T96 replaced with A, G, I, L, S, 
M, or V; L98 replaced with A, G, I, S, T, M, or V; ElOO replaced with D; TlOl replaced 
with A, G, I, L, S, M, or V; D102 replaced with E; L103 replaced with A, G, I, S, T, M, or 
V; A104 replaced with G, I, L, S, T, M, or V; H105 replaced with K, or R; F107 replaced 
with W, or Y; Y108 replaced with F, or W; S 109 replaced with A, G, I, L, T, M, or V; Gl 10 
replaced with A, I, L, S, T, M, or V; Till replaced with A, G, I, L, S, M, or V; VI 12 
replaced with A, G, I, L, S, T, or M; Nl 13 replaced with Q; Gl 14 replaced with A, I, L, S, 
T, M, or V; D 1 1 5 replaced with E; S 1 17 replaced with A, G, I, L, T, M, or V; S 1 1 8 replaced 
with A, G, I, L, T, M, or V; Al 19 replaced with G, I, L, S, T, M, or V; A120 replaced with 
G, I, L, S, T, M, or V; A121 replaced with G, I, L, S, T, M, or V; LI 22 replaced with A, G, 
I, S, T, M, or V; S123 replaced with A, G, I, L, T, M, or V; L124 replaced with A, G, I, S, 
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T, M, or V; E126 replaced with D; G127 replaced with A, I, L, S, T, M, or V; V128 replaced 
with A, G, I, L, S, T, or M; R129 replaced with H, or K; G130 replaced with A, I, L, S, T, 
M, or V; A131 replaced with G, I, L, S, T, M, or V; F132 replaced with W, or Y; Y133 
replaced with F, or W; L134 replaced with A, G, I, S, T, M, or V; L135 replaced with A, G, 
I, S, T, M, or V; G136 replaced with A, I, L, S, T, M, or V; E137 replaced with D; A138 
replaced with G, I, L, S, T, M, or V; Y139 replaced with F, or W; F140 replaced with W, 
or Y; 1141 replaced with A, G, L, S, T, M, or V; Q142 replaced with N; L144 replaced with 
A, G, I, S, T, M, or V; A146 replaced with G, I, L, S, T, M, or V; A147 replaced with G, I, 
L, S, T, M, or V; S 148 replaced with A, G, I, L, T, M, or V; E149 replaced with D; R150 
replaced with H, or K; L151 replaced with A, G, I, S, T, M, or V; A152 replaced with G, I, 
L, S, T, M, or V; T153 replaced with A, G, I, L, S, M, or V; A154 replaced with G, I, L, S, 
T, M, or V; A155 replaced with G, I, L, S, T, M, or V; G157 replaced with A, I, L, S, T, M, 
or V; E158 replaced with D; K159 replaced with H, or R; A162 replaced with G, I, L, S, T, 
M, or V; L164 replaced with A, G, I, S, T, M, or V; Q165 replaced with N; F166 replaced 
with W, or Y; H167 replaced with K, or R; L168 replaced with A, G, I, S, T, M, or V; L169 
replaced with A, G, I, S, T, M, or V; R170 replaced with H, or K; R17 1 replaced with H, or 
K; N172 replaced with Q; R173 replaced with H, or K; Q174 replaced with N; G175 
replaced with A, I, L, S, T, M, or V; D176 replaced with E; V177 replaced with A, G, I, L, 
S, T, or M; G178 replaced with A, I, L, S, T, M, or V; G179 replaced with A, I, L, S, T, M, 
or V; T180 replaced with A, G, I, L, S, M, or V; G182 replaced with A, I, L, S, T, M, or V; 
V183 replaced with A, G, I, L, S, T, or M; V184 replaced with A, G, I, L, S, T, or M; D185 
replaced with E; D186 replaced with E; El 87 replaced with D; Rl 89 replaced with H, or K; 
T191 replaced with A, G, I, L, S, M, or V; G192 replaced with A, I, L, S, T, M, or V; K193 
replaced with H, or R; A194 replaced with G, I, L, S, T, M, or V; E195 replaced with D; 
T196 replaced with A, G, I, L, S, M, or V; E197 replaced with D; D198 replaced with E; 
E199 replaced with D; D200 replaced with E; E201 replaced with D; G202 replaced with 
A, I, L, S, T, M, or V; T203 replaced with A, G, I, L, S, M, or V; E204 replaced with D; 
G205 replaced with A, I, L, S, T, M, or V; E206 replaced with D; D207 replaced with E; 
E208 replaced with D; G209 replaced with A, I, L, S, T, M, or V; Q21 1 replaced with N; 
W212 replaced with F, or Y; S213 replaced with A, G, I, L, T, M, or V; Q215 replaced with 
N; D216 replaced with E; A218 replaced with G, I, L, S, T, M, or V; L219 replaced with A, 
G, I, S, T, M, or V; Q220 replaced with N; G221 replaced with A, I, L, S, T, M, or V; V222 
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replaced with A, G, I, L, S, T, or M; G223 replaced with A, I, L, S, T, M, or V; Q224 
replaced with N; T226 replaced with A, G, I, L, S, M, or V; G227 replaced with A, I, L, S, 
T, M, or V; T228 replaced with A, G, I, L, S, M, or V; G229 replaced with A, I, L, S, T, M, 
or V; S230 replaced with A, G, I, L, T, M, or V; 1231 replaced with A, G, L, S, T, M, or V; 
5 R232 replaced with H, or K; K233 replaced with H, or R; K234 replaced with H, or R; R235 

replaced with H, or K; F236 replaced with W, or Y; V237 replaced with A, G, I, L, S, T, or 
M; S238 replaced with A, G, I, L, T, M, or V; S239 replaced with A, G, I, L, T, M, or V; 
H240 replaced with K, or R; R241 replaced with H, or K; Y242 replaced with F, or W; 
V243 replaced with A, G, I, L, S, T, or M; E244 replaced with D; T245 replaced with A, G, 
10 I, L, S, M, or V; M246 replaced with A, G, I, L, S, T, or V; L247 replaced with A, G, I, S, 

T, M, or V; V248 replaced with A, G, I, L, S , T, or M; A249 replaced with G, I, L, S , T, M, 
or V; D250 replaced with E; Q25 1 replaced with N; S252 replaced with A, G, I, L, T, M, or 
W V; M253 replaced with A, G, I, L, S, T, or V; A254 replaced with G, I, L, S, T, M, or V; 

in E255 replaced with D; F256 replaced with W, or Y; H257 replaced with K, or R; G258 

if replaced with A, I, L, S, T, M, or V; S259 replaced with A, G, I, L, T, M, or V; G260 

j:^ replaced with A, I, L, S, T, M, or V; L261 replaced with A, G, I, S, T, M, or V; K262 

1= replaced with H, or R; H263 replaced with K, or R; Y264 replaced with F, or W; L265 

:| replaced with A, G, I, S, T, M, or V; L266 replaced with A, G, I, S, T, M, or V; T267 

replaced with A, G, I, L, S, M, or V; L268 replaced with A, G, I, S, T, M, or V; F269 
20 replaced with W, or Y; S270 replaced with A, G, I, L, T, M, or V; V271 replaced with A, 

G, I, L, S, T, or M; A272 replaced with G, I, L, S, T, M, or V; A273 replaced with G, I, L, 
S, T, M, or V; R274 replaced with H, or K; L275 replaced with A, G, I, S, T, M, or V; Y276 
replaced with F, or W; K277 replaced with H, or R; H278 replaced with K, or R; S280 
replaced with A, G, I, L, T, M, or V; 1281 replaced with A, G, L, S, T, M, or V; R282 
25 replaced with H, or K; N283 replaced with Q; S284 replaced with A, G, I, L, T, M, or V; 

V285 replaced with A, G, I, L, S, T, or M; S286 replaced with A, G, I, L, T, M, or V; L287 
replaced with A, G, I, S, T, M, or V; V288 replaced with A, G, I, L, S, T, or M; V289 
replaced with A, G, I, L, S, T, or M; V290 replaced with A, G, I, L, S, T, or M; K291 
replaced with H, or R; 1292 replaced with A, G, L, S, T, M, or V; L293 replaced with A, G, 
30 I, S, T, M, or V; V294 replaced with A, G, I, L, S, T, or M; 1295 replaced with A, G, L, S, 

T, M, or V; H296 replaced with K, or R; D297 replaced with E; E298 replaced with D; 
Q299 replaced with N; K300 replaced with H, or R; G301 replaced with A, I, L, S, T, M, 
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or V; E303 replaced with D; V304 replaced with A, G, I, L, S, T, or M; T305 replaced with 
A, G, I, L, S, M, or V; S306 replaced with A, G, I, L, T, M, or V; N307 replaced with Q; 
A308 replaced with G, I, L, S, T, M, or V; A309 replaced with G, I, L, S, T, M, or V; L3 10 
replaced with A, G, I, S, T, M, or V; T311 replaced with A, G, I, L, S, M, or V; L312 
replaced with A, G, I, S, T, M, or V; R313 replaced with H, or K; N314 replaced with Q; 
F315 replaced with W, or Y; N317 replaced with Q; W318 replaced with F, or Y; Q319 
replaced with N; K320 replaced with H, or R; Q321 replaced with N; H322 replaced with 
K, or R; N323 replaced with Q; S326 replaced with A, G, I, L, T, M, or V; D327 replaced 
with E; R328 replaced with H, or K; D329 replaced with E; A330 replaced with G, I, L, S, 
T, M, or V; E331 replaced with D; H332 replaced with K, or R; Y333 replaced with F, or 
W; D334 replaced with E; T335 replaced with A, G, I, L, S, M, or V; A336 replaced with 
G, I, L, S, T, M, or V; 1337 replaced with A, G, L, S, T, M, or V; L338 replaced with A, G, 
I, S, T, M, or V; F339 replaced with W, or Y; T340 replaced with A, G, I, L, S, M, or V; 
R341 replaced with H, or K; Q342 replaced with N; D343 replaced with E; L344 replaced 
with A, G, I, S, T, M, or V; G346 replaced with A, I, L, S, T, M, or V; S347 replaced with 
A, G, I, L, T, M, or V; Q348 replaced with N; T349 replaced with A, G, I, L, S, M, or V; 
D35 1 replaced with E; T352 replaced with A, G, I, L, S, M, or V; L353 replaced with A, G, 
I, S, T, M, or V; G354 replaced with A, I, L, S, T, M, or V; M355 replaced with A, G, I, L, 
S, T, or V; A356 replaced with G, I, L, S, T, M, or V; D357 replaced with E; V358 replaced 
with A, G, I, L, S, T, or M; G359 replaced with A, I, L, S, T, M, or V; T360 replaced with 
A, G, I, L, S, M, or V; V361 replaced with A, G, I, L, S, T, or M; D363 replaced with E; 
S365 replaced with A, G, I, L, T, M, or V; R366 replaced with H, or K; S367 replaced with 
A, G, I, L, T, M, or V; S369 replaced with A, G, I, L, T, M, or V; V370 replaced with A, G, 
I, L, S, T, or M; 1371 replaced with A, G, L, S, T, M, or V; E372 replaced with D; D373 
replaced with E; D374 replaced with E; G375 replaced with A, I, L, S, T, M, or V; L376 
replaced with A, G, I, S, T, M, or V; Q377 replaced with N; A378 replaced with G, I, L, S, 
T, M, or V; A379 replaced with G, I, L, S, T, M, or V; F380 replaced with W, or Y; T381 
replaced with A, G, I, L, S, M, or V; T382 replaced with A, G, I, L, S, M, or V; A383 
replaced with G, I, L, S, T, M, or V; H384 replaced with K, or R; E385 replaced with D; 
L386 replaced with A, G, I, S, T, M, or V; G387 replaced with A, I, L, S, T, M, or V; H388 
replaced with K, or R; V389 replaced with A, G, I, L, S, T, or M; F390 replaced with W, or 
Y; N391 replaced with Q; M392 replaced with A, G, I, L, S, T, or V; H394 replaced with 
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K, or R; D395 replaced with E; D396 replaced with E; A397 replaced with G, I, L, S, T, M, 
or V; K398 replaced with H, or R; Q399 replaced with N; A401 replaced with G, I, L, S, T, 
M, or V; S402 replaced with A, G, I, L, T, M, or V; L403 replaced with A, G, I, S, T, M, or 
V; N404 replaced with Q; G405 replaced with A, I, L, S, T, M, or V; V406 replaced with 
5 A, G, I, L, S, T, or M; N407 replaced with Q; Q408 replaced with N; D409 replaced with 

E; S410 replaced with A, G, I, L, T, M, or V; H41 1 replaced with K, or R; M412 replaced 
with A, G, I, L, S, T, or V; M413 replaced with A, G, I, L, S, T, or V; A414 replaced with 
G, I, L, S, T, M, or V; S415 replaced with A, G, I, L, T, M, or V; M416 replaced with A, G, 
I, L, S, T, or V; L417 replaced with A, G, I, S, T, M, or V; S418 replaced with A, G, I, L, 
10 T, M, or V; N419 replaced with Q; L420 replaced with A, G, I, S, T, M, or V; D421 replaced 

J with E; H422 replaced with K, or R; S423 replaced with A, G, I, L, T, M, or V; Q424 

replaced with N; W426 replaced with F, or Y; S427 replaced with A, G, I, L, T, M, or V; 
iU S430 replaced with A, G, I, L, T, M, or V; A43 1 replaced with G, I, L, S, T, M, or V; Y432 

replaced with F, or W; M433 replaced with A, G, I, L, S, T, or V; 1434 replaced with A, G, 
Ip-' L, S, T, M, or V; T435 replaced with A, G, I, L, S, M, or V; S436 replaced with A, G, I, L, 

O T, M, or V; F437 replaced with W, or Y; L438 replaced with A, G, I, S, T, M, or V; D439 

f~ replaced with E; N440 replaced with Q; G441 replaced with A, I, L, S, T, M, or V; H442 

replaced with K, or R; G443 replaced with A, I, L, S, T, M, or V; E444 replaced with D; 
^ L446 replaced with A, G, I, S, T, M, or V; M447 replaced with A, G, I, L, S, T, or V; D448 

20 replaced with E; K449 replaced with H, or R; Q45 1 replaced with N; N452 replaced with 

Q; 1454 replaced with A, G, L, S, T, M, or V; Q455 replaced with N; L456 replaced with A, 
G, I, S, T, M, or V; G458 replaced with A, I, L, S, T, M, or V; D459 replaced with E; L460 
replaced with A, G, I, S, T, M, or V; G462 replaced with A, I, L, S, T, M, or V; T463 
replaced with A, G, I, L, S, M, or V; S464 replaced with A, G, I, L, T, M, or V; Y465 
25 replaced with F, or W; D466 replaced with E; A467 replaced with G, I, L, S, T, M, or V; 

N468 replaced with Q; R469 replaced with H, or K; Q470 replaced with N; Q472 replaced 
with N; F473 replaced with W, or Y; T474 replaced with A, G, I, L, S, M, or V; F475 
replaced with W, or Y; G476 replaced with A, I, L, S, T, M, or V; E477 replaced with D; 
D478 replaced with E; S479 replaced with A, G, I, L, T, M, or V; K480 replaced with H, or 
30 R; H48 1 replaced with K, or R; D484 replaced with E; A485 replaced with G, I, L, S, T, M, 

or V; A486 replaced with G, I, L, S, T, M, or V; S487 replaced with A, G, I, L, T, M, or V; 
T488 replaced with A, G, I, L, S, M, or V; S490 replaced with A, G, I, L, T, M, or V; T491 
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replaced with A, G, I, L, S, M, or V; L492 replaced with A, G, I, S, T, M, or V; W493 
replaced with F, or Y; T495 replaced with A, G, I, L, S, M, or V; G496 replaced with A, I, 
L, S, T, M, or V; T497 replaced with A, G, I, L, S, M, or V; S498 replaced with A, G, I, L, 
T, M, or V; G499 replaced with A, I, L, S, T, M, or V; G500 replaced with A, I, L, S, T, M, 
5 or V; V501 replaced with A, G, I, L, S, T, or M; L502 replaced with A, G, I, S, T, M, or V; 

V503 replaced with A, G, I, L, S, T, or M; Q505 replaced with N; T506 replaced with A, G, 
I, L, S, M, or V; K507 replaced with H, or R; H508 replaced with K, or R; F509 replaced 
with W, or Y; W5 1 1 replaced with F, or Y; A5 12 replaced with G, I, L, S, T, M, or V; D5 1 3 
replaced with E; 0514 replaced with A, I, L, S, T, M, or V; T515 replaced with A, G, I, L, 
10 S, M, or V; S516 replaced with A, G, I, L, T, M, or V; G518 replaced with A, I, L, S, T, M, 

;| or V; E519 replaced with D; G520 replaced with A, I, L, S, T, M, or V; K521 replaced with 

5j{ H, or R; W522 replaced with F, or Y; 1524 replaced with A, G, L, S, T, M, or V; N525 

W replaced with Q; G526 replaced with A, I, L, S, T, M, or V; K527 replaced with H, or R; 

m V529 replaced with A, G, I, L, S, T, or M; N530 replaced with Q; K53 1 replaced with H, 

Vfy or R; T532 replaced with A, G, I, L, S, M, or V; D533 replaced with E; R534 replaced with 

K H, or K; K535 replaced with H, or R; H536 replaced with K, or R; F537 replaced with W, 

i=l or Y; D538 replaced with E; T539 replaced with A, G, I, L, S, M, or V; F541 replaced with 

■;; W, or Y; H542 replaced with K, or R; G543 replaced with A, I, L, S, T, M, or V; S544 

replaced with A, G, I, L, T, M, or V; W545 replaced with F, or Y; G546 replaced with A, 
20 I, L, S, T, M, or V; M547 replaced with A, G, I, L, S, T, or V; W548 replaced with F, or Y; 

G549 replaced with A, I, L, S, T, M, or V; W55 1 replaced with F, or Y; G552 replaced with 
A, I, L, S, T, M, or V; D553 replaced with E; S555 replaced with A, G, I, L, T, M, or V; 
R556 replaced with H, or K; T557 replaced with A, G, I, L, S, M, or V; G559 replaced with 
A, I, L, S, T, M, or V; G560 replaced with A, I, L, S, T, M, or V; G56 1 replaced with A, I, 
25 L, S, T, M, or V; V562 replaced with A, G, I, L, S, T, or M; Q563 replaced with N; Y564 

replaced with F, or W; T565 replaced with A, G, I, L, S, M, or V; M566 replaced with A, 
G, I, L, S, T, or V; R567 replaced with H, or K; E568 replaced with D; D570 replaced with 
E; N571 replaced with Q; V573 replaced with A, G, I, L, S, T, or M; K575 replaced with H, 
or R; N576 replaced with Q; G577 replaced with A, I, L, S, T, M, or V; G578 replaced with 
30 A, I, L, S, T, M, or V; K579 replaced with H, or R; Y580 replaced with F, or W; E582 

replaced with D; G583 replaced with A, I, L, S, T, M, or V; K584 replaced with H, or R; 
R585 replaced with H, or K; V586 replaced with A, G, I, L, S, T, or M; R587 replaced with 
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H, or K; Y588 replaced with F, or W; R589 replaced with H, or K; S590 replaced with A, 

G, I, L, T, M, or V; N592 replaced with Q; L593 replaced with A, G, I, S, T, M, or V; E594 
replaced with D; D595 replaced with E; D598 replaced with E; N599 replaced with Q; N600 
replaced with Q; G601 replaced with A, I, L, S, T, M, or V; K602 replaced with H, or R; 

5 T603 replaced with A, G, I, L, S , M, or V; F604 replaced with W, or Y; R605 replaced with 

H, or K; E606 replaced with D; E607 replaced with D; Q608 replaced with N; E6 10 replaced 
with D; A611 replaced with G, I, L, S, T, M, or V; H612 replaced with K, or R; N613 
replaced with Q; E614 replaced with D; F615 replaced with W, or Y; S616 replaced with 
A, G, I, L, T, M, or V; K617 replaced with H, or R; A618 replaced with G, I, L, S, T, M, or 

10 V; S619 replaced with A, G, I, L, T, M, or V; F620 replaced with W, or Y; G621 replaced 

with A, I, L, S, T, M, or V; S622 replaced with A, G, I, L, T, M, or V; G623 replaced with 
A, I, L, S, T, M, or V; A625 replaced with G, I, L, S, T, M, or V; V626 replaced with A, G, 

lu I, L, S, T, or M; E627 replaced with D; W628 replaced with F, or Y; 1629 replaced with A, 

G, L, S, T, M, or V; K631 replaced with H, or R; Y632 replaced with F, or W; A633 
1# replaced with G, I, L, S, T, M, or V; G634 replaced with A, I, L, S, T, M, or V; V635 

replaced with A, G, I, L, S, T, or M; S636 replaced with A, G, I, L, T, M, or V; K638 
tl replaced with H, or R; D639 replaced with E; R640 replaced with H, or K; K642 replaced 

^ with H, or R; L643 replaced with A, G, I, S, T, M, or V; 1644 replaced with A, G, L, S, T, 

i2 M, or V; Q646 replaced with N; A647 replaced with G, I, L, S, T, M, or V; K648 replaced 

20 with H, or R; G649 replaced with A, I, L, S, T, M, or V; 1650 replaced with A, G, L, S, T, 

M, or V; G651 replaced with A, I, L, S, T, M, or V; Y652 replaced with F, or W; F653 
replaced with W, or Y; F654 replaced with W, or Y; V655 replaced with A, G, I, L, S, T, 
or M; L656 replaced with A, G, I, S, T, M, or V; Q657 replaced with N; K659 replaced with 

H, or R; V660 replaced with A, G, I, L, S, T, or M; V661 replaced with A, G, I, L, S, T, or 
25 M; D662 replaced with E; G663 replaced with A, I, L, S, T, M, or V; T664 replaced with 

A, G, I, L, S, M, or V; S667 replaced with A, G, I, L, T, M, or V; D669 replaced with E; 
S670 replaced with A, G, I, L, T, M, or V; T671 replaced with A, G, I, L, S, M, or V; S672 
replaced with A, G, I, L, T, M, or V; V673 replaced with A, G, I, L, S, T, or M; V675 
replaced with A, G, I, L, S, T, or M; Q676 replaced with N; G677 replaced with A, I, L, S, 
30 T, M, or V; Q678 replaced with N; V680 replaced with A, G, I, L, S, T, or M; K68 1 replaced 

with H, or R; A682 replaced with G, I, L, S, T, M, or V; G683 replaced with A, I, L, S, T, 
M, or V; D685 replaced with E; R686 replaced with H, or K; 1687 replaced with A, G, L, 
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S, T, M, or V; 1688 replaced with A, G, L, S, T, M, or V; D689 replaced with E; S690 
replaced with A, G, I, L, T, M, or V; K691 replaced with H, or R; K692 replaced with H, or 
R; K693 replaced with H, or R; F694 replaced with W, or Y; D695 replaced with E; K696 
replaced with H, or R; G698 replaced with A, I, L, S, T, M, or V; V699 replaced with A, G, 
5 I, L, S, T, or M; G701 replaced with A, I, L, S, T. M, or V; G702 replaced with A, I, L, S, 

T, M, or V; N703 replaced with Q; G704 replaced with A, I, L, S, T, M, or V; S705 replaced 
with A, G, I, L, T. M, or V; T706 replaced with A, G, I, L, S, M, or V; K708 replaced with 

H, or R; K709 replaced with H, or R; 1710 replaced with A, G, L, S, T, M, or V; S711 
replaced with A, G, I, L, T, M, or V; G712 replaced with A, I, L, S, T, M, or V; S713 

10 replaced with A, G, I, L, T, M, or V; V714 replaced with A, G, I, L, S, T, or M; T715 

^ replaced with A, G, I, L, S, M, or V; S716 replaced with A, G, I, L, T, M, or V; A717 

J-j replaced with G, I, L, S , T, M, or V; K7 1 8 replaced with H, or R; G720 replaced with A, I, 

m L, S, T, M, or V; Y721 replaced with F, or W; H722 replaced with K, or R; D723 replaced 

^ with E; 1724 replaced with A, G, L, S, T, M, or V; 1725 replaced with A, G, L, S, T, M, or 

1# V; T726 replaced with A, G, I, L, S, M, or V; 1727 replaced with A, G, L, S, T, M, or V; 

Q T729 replaced with A, G, I, L, S, M, or V; G730 replaced with A, I, L, S, T, M, or V; A73 1 

Jl. replaced with G, I, L, S, T, M, or V; T732 replaced with A, G, I, L, S, M, or V; N733 

replaced with Q; 1734 replaced with A, G, L, S, T, M, or V; E735 replaced with D; V736 
hP. replaced with A, G, I, L, S, T, or M; K737 replaced with H, or R; Q738 replaced with N; 

20 R739 replaced with H, or K; N740 replaced with Q ; Q74 1 replaced with N; R742 replaced 

with H, or K; G743 replaced with A, I, L, S, T, M, or V; S744 replaced with A, G, I, L, T, 
M, or V; R745 replaced with H, or K; N746 replaced with Q; N747 replaced with Q; G748 
replaced with A, I, L, S, T, M, or V; S749 replaced with A, G, I, L, T, M, or V; F750 
replaced with W, or Y; L751 replaced with A, G, I, S, T, M, or V; A752 replaced with G, 
25 I, L, S, T, M, or V; 1753 replaced with A, G, L, S, T, M, or V; K754 replaced with H, or R; 

A755 replaced with G, I, L, S, T, M, or V; A756 replaced with G, I, L, S, T, M, or V; D757 
replaced with E; G758 replaced with A, I, L, S, T, M, or V; T759 replaced with A, G, I, L, 
S, M, or V; Y760 replaced with F, or W; 1761 replaced with A, G, L, S, T, M, or V; L762 
replaced with A, G, I, S, T, M, or V; N763 replaced with Q; G764 replaced with A, I, L, S, 
30 T, M, or V; D765 replaced with E; Y766 replaced with F, or W; T767 replaced with A, G, 

I, L, S, M, or V; L768 replaced with A, G, I, S, T, M, or V; S769 replaced with A, G, I, L, 
T, M, or V; T770 replaced with A, G, I, L, S, M, or V; L771 replaced with A, G, I, S, T, M, 
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or V; E772 replaced with D; Q773 replaced with N; D774 replaced with E; 1775 replaced 
with A, G, L, S, T, M, or V; M776 replaced with A, G, I, L, S, T, or V; Y777 replaced with 

F, or W; K778 replaced with H, or R; G779 replaced with A, I, L, S, T, M, or V; V780 
replaced with A, G, I, L, S, T, or M; V781 replaced with A, G, I, L, S, T, or M; L782 
replaced with A, G, I, S, T, M, or V; R783 replaced with H, or K; Y784 replaced with F, or 
W; S785 replaced with A, G, I, L, T, M, or V; G786 replaced with A, I, L, S, T, M, or V; 
S787 replaced with A, G, I, L, T, M, or V; S788 replaced with A, G, I, L, T, M, or V; A789 
replaced with G, I, L, S, T, M, or V; A790 replaced with G, 1, L, S, T, M, or V; L791 
replaced with A, G, I, S, T, M, or V; E792 replaced with D; R793 replaced with H, or K; 
1794 replaced with A, G, L, S, T, M, or V; R795 replaced with H, or K; S796 replaced with 
A, G, I, L, T, M, or V; F797 replaced with W, or Y; S798 replaced with A, G, I, L, T, M, or 
V; L800 replaced with A, G, I, S, T, M, or V; K801 replaced with H, or R; E802 replaced 
with D; L804 replaced with A, G, I, S, T, M, or V; T805 replaced with A, G, I, L, S, M, or 
V; 1806 replaced with A, G, L, S, T, M, or V; Q807 replaced with N; V808 replaced with 
A, G, I, L, S, T, or M; L809 replaced with A, G, I, S, T, M, or V; T810 replaced with A, G, 
I, L, S, M, or V; V81 1 replaced with A, G, I, L, S, T, or M; G812 replaced with A, I, L, S, 
T, M, or V; N8 1 3 replaced with Q; A8 1 4 replaced with G, I, L, S , T, M, or V; L8 1 5 replaced 
with A, G, I, S, T, M, or V; R816 replaced with H, or K; K818 replaced with H, or R; 1819 
replaced with A, G, L, S, T, M, or V; K820 replaced with H, or R; Y821 replaced with F, 
or W; T822 replaced with A, G, I, L, S, M, or V; Y823 replaced with F, or W; F824 replaced 
with W, or Y; V825 replaced with A, G, I, L, S, T, or M; K826 replaced with H, or R; K827 
replaced with H, or R; K828 replaced with H, or R; K829 replaced with H, or R; E830 
replaced with D; S831 replaced with A, G, I, L, T, M, or V; F832 replaced with W, or Y; 
N833 replaced with Q; A834 replaced with G, I, L, S, T, M, or V; 1835 replaced with A, G, 
L, S, T, M, or V; T837 replaced with A, G, I, L, S, M, or V; F838 replaced with W, or Y; 
S839 replaced with A, G, I, L, T, M, or V; A840 replaced with G, I, L, S, T, M, or V; W841 
replaced with F, or Y; V842 replaced with A, G, I, L, S, T, or M; 1843 replaced with A, G, 
L, S, T, M, or V; E844 replaced with D; E845 replaced with D; W846 replaced with F, or 
Y; G847 replaced with A, I, L, S, T, M, or V; E848 replaced with D; S850 replaced with A, 

G, I, L, T, M, or V; K851 replaced with H, or R; S852 replaced with A, G, I, L, T, M, or V; 
E854 replaced with D; L855 replaced witii A, G, I, S, T, M, or V; G856 replaced with A, I, 
L, S, T, M, or V; W857 replaced with F, or Y; Q858 replaced with N; R859 replaced with 
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H, or K; R860 replaced with H, or K; L861 replaced with A, G, I, S, T, M, or V; V862 
replaced with A, G, I, L, S, T, or M; E863 replaced with D; R865 replaced with H, or K; 
D866 replaced with E; 1867 replaced with A, G, L, S, T, M, or V; N868 replaced with Q; 
G869 replaced with A, I, L, S, T, M, or V; Q870 replaced with N; A872 replaced with G, 

5 I, L, S, T, M, or V; S873 replaced with A, G, I, L, T, M, or V; E874 replaced with D; A876 

replaced with G, I, L, S, T, M, or V; K877 replaced with H, or R; E878 replaced with D; 
V879 replaced with A, G, I, L, S, T, or M; K880 replaced with H, or R; A882 replaced with 
G, I, L, S, T, M, or V; S883 replaced with A, G, I, L, T, M, or V; T884 replaced with A, G, 

I, L, S, M, or V; R885 replaced with H, or K; A888 replaced with G, I, L, S, T, M, or V; 
10 D889 replaced with E; H890 replaced with K, or R; Q894 replaced with N; W895 replaced 
^ with F, or Y; Q896 replaced with N; L897 replaced with A, G, I, S, T, M, or V; G898 
y J replaced with A, I, L, S, T, M, or V; E899 replaced with D; W900 replaced with F, or Y; 

S901 replaced with A, G, I, L, T, M, or V; S902 replaced with A, G, I, L, T, M, or V; S904 
replaced with A, G, I, L, T, M, or V; K905 replaced with H, or R; T906 replaced with A, G, 

1-* I, L, S, M, or V; G908 replaced with A, I, L, S, T, M, or V; K909 replaced with H, or R; 

,p G910 replaced with A, I, L, S, T, M, or V; Y911 replaced with F, or W; K912 replaced with 

: : H, or R; K9 1 3 replaced with H, or R; R9 14 replaced with H, or K; S9 1 5 replaced with A, 

G, I, L, T, M, or V; L9 1 6 replaced with A, G, I, S, T, M, or V; K9 17 replaced with H, or R; 
Q L919 replaced with A, G, I, S, T, M, or V; S920 replaced with A, G, I, L, T, M, or V; H921 

20 replaced with K, or R; D922 replaced with E; G923 replaced with A, I, L, S, T, M, or V; 

G924 replaced with A, I, L, S, T, M, or V; V925 replaced with A, G, I, L, S, T, or M; L926 
replaced with A, G, I, S, T, M, or V; S927 replaced with A, G, I, L, T, M, or V; H928 
replaced with K, or R; E929 replaced with D; S930 replaced with A, G, I, L, T, M, or V; 
D932 replaced with E; L934 replaced with A, G, I, S, T, M, or V; K935 replaced with H, or 

25 R; K936 replaced with H, or R; K938 replaced with H, or R; H939 replaced with K, or R; 

F940 replaced with W, or Y; 1941 replaced with A, G, L, S, T, M, or V; D942 replaced with 
E; F943 replaced with W, or Y; T945 replaced with A, G, I, L, S, M, or V; M946 replaced 
with A, G, I, L, S, T, or V; A947 replaced with G, I, L, S, T, M, or V; E948 replaced with 
D; S950 replaced with A, G, I, L, T, M, or V. 

50 Also preferred are METHl polypeptides with one or more of the following non- 

conservative substitutions: Ml replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G2 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N3 replaced with D, E, H, K, R, A, G, 
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I, L, S, T, M, V, F, W, Y, P. or C; A4 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
E5 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R6 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A7 replaced with D, E, H, K, R, N, Q, 
F, W, Y, P, or C; P8 replaced with D, E. H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or 
5 C; G9 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S 10 replaced with D, E, H, K, 

R, N, Q, F, W, Y, P, or C; Rl 1 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; S12 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F13 replaced with D, E, 
H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G14 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; P15 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
10 V16 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P17 replaced with D, E, H, K, R, 

S A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; T18 replaced with D, E, H, K, R, N, Q, F, W, 

;3 Y, P, or C; L19 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L20 replaced with D, 

W E, H, K, R, N, Q, F, W, Y, P, or C; L21 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

m C; L22 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A23 replaced with D, E, H, K, 

ff R, N, Q, F, W, Y, P, or C; A24 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A25 

g replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L26 replaced with D, E, H, K, R, N, Q, 

M- F, W, Y, P, or C; L27 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A28 replaced 

% with D, E, H, K, R, N, Q, F, W, Y, P, or C; V29 replaced with D, E, H, K, R, N, Q, F, W, 

^ Y, P, or C; S30 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D3 1 replaced with H, 

20 K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A32 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; L33 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G34 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; R35 replaced with D, E, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; P36 replaced with D, E, H. K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or C; S37 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E38 replaced with H, K, 
25 R, A, G, I, L, S, T, M, V, N. Q, F, W, Y, P, or C; E39 replaced with H, K, R, A, G, I, L, S, 

T, M, V, N, Q, F, W, Y, P, or C; D40 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, P, or C; E41 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
E42 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L43 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; V44 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
30 or C; V45 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P46 replaced with D, E, H, 

K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E47 replaced with H, K, R, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, P, or C; L48 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
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E49 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R50 replaced with 

D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A5 1 replaced with D, E, H, K, R, N, Q, 
F, W, Y, P, or C; P52 replaced with D, E, H, K. R. A, G, I, L, S, T, M, V, N, Q, F, W, Y, or 
C; G53 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H54 replaced with D, E, A, G, 

5 I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G55 replaced with D, E, H, K, R, N, Q, F, W, Y, 

P, or C; T56 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T57 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; R58 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; L59 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R60 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L61 replaced with D, E, H, K, R, N, Q, F, 
10 W, Y, P, or C; H62 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A63 

^ replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F64 replaced with D, E, H, K, R, N, Q, 

J-J A, G, I, L, S, T, M, V, P, or C; D65 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 

Ly W, Y, P, or C; Q66 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 

% Q67 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; L68 replaced with 

iP D, E, H, K, R, N, Q, F, W, Y, P, or C; D69 replaced with H, K, R, A, G, I, L, S, T, M, V, 

^ N, Q, F, W, Y, P, or C; L70 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E71 

tl replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L72 replaced with D, 

^ E, H, K, R, N, Q, F, W, Y, P, or C; R73 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 

W, Y, P, or C; P74 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
20 D75 replaced with H, K, R, A, G, I, L, S , T, M, V, N, Q, F, W, Y, P, or C; S76 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; S77 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; F78 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L79 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; A80 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; P81 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
25 G82 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F83 replaced with D, E, H, K, R, 

N, Q, A, G, I, L, S, T, M, V, P, or C; T84 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; L85 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q86 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; N87 replaced with D, E, H, K, R, A, G, I, L, S, 
T, M, V, F, W, Y, P, or C; V88 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G89 
30 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R90 replaced with D, E, A, G, I, L, S, 

T, M, V, N, Q, F, W, Y, P, or C; K9 1 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; S92 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G93 replaced with D, 
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E, H, K, R, N, Q, F, W, Y, P, or C; S94 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; E95 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T96 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; P97 replaced with D, E, H, K, R, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, or C; L98 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P99 
5 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; ElOO replaced with 

H, K, R. A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; TlOl replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; D102 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; L103 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A104 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; H105 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 

10 F, W, Y, P, or C; C106 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 

or P; F107 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Y108 replaced 
W with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S 109 replaced with D, E, H, K, R, 

W N, Q, F, W, Y, P, or C; Gl 10 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Tl 1 1 

M=i replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; VI 12 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; Nl 13 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 
O or C; Gl 14 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D115 replaced with H, K, 

P R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; PI 16 replaced with D, E, H, K, R, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, or C; S 1 17 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
%Q or C; SI 18 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Al 19 replaced with D, E, 

20 H, K, R, N, Q, F, W, Y, P, or C; A120 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

A121 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L122 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; S 123 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L124 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C125 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, N, Q, F, W, Y, or P; E 126 replaced with H, K, R, A, G, I, L, S, T, M, V, 

25 N, Q, F, W, Y, P, or C; G127 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V128 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R129 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; G130 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; A131 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F132 replaced with D, E, H, 
K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Y133 replaced with D, E, H, K, R, N, Q, A, G, 

30 I, L, S, T, M, V, P, or C; L134 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L135 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G136 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; E137 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
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or C; A138 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Y139 replaced with D, E," 

H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; F140 replaced with D, E, H, K, R, N, Q, A, 

G, I, L, S, T, M, V, P, or C; 1141 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q142 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P143 replaced with D, 

5 E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L144 replaced with D, E, H, K, R, 

N, Q, F, W, Y, P, or C; P145 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or C; A146 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A147 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; S 148 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; E149 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R150 

10 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L15 1 replaced with D, 

I E, H, K, R, N, Q, F, W, Y, P, or C; A152 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

» C; T153 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A154 replaced with D, E, H, 

y K, R, N, Q, F, W, Y, P, or C; A155 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

J; P156 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G157 

9 5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; El 58 replaced with H, K, R, A, G, I, 

q L, S, T, M, V, N, Q, F, W, Y, P, or C; K159 replaced with D, E, A, G, I, L, S, T, M, V, N, 

Q, F, W, Y, P, or C; P160 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
:j Y, or C; P161 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A162 

0 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; PI 63 replaced with D, E, H, K, R, A, 

20 G, I, L, S, T, M, V, N, Q, F, W, Y, or C; LI 64 replaced with D, E, H, K, R, N, Q, F, W, Y, 

P, or C; Q165 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; F166 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; H167 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L168 replaced with D, E, H, K, R, N, Q, 

F, W, Y, P, or C; L169 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R170 replaced 
25 with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R171 replaced with D, E, A, G, 

I, U S, T, M. V, N, Q, F, W, Y, P, or C; N172 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, F, W, Y, P, or C; R173 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; Q174 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G175 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D176 replaced with H, K, R, A, G, I, 

30 L, S, T, M, V, N, Q, F, W, Y, P, or C; V177 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; G178 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G179 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; T180 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
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C181 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G182 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V183 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V184 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D185 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D186 replaced with 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E187 replaced with H, K, R, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P188 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, or C; R189 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; P190 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; T191 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G192 replaced with D, E, H, K, R, N, 
Q, F, W, Y. P, or C; K193 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; A194 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E195 replaced with H, K, R, 
A, G, I. L, S, T, M, V, N, Q, F, W, Y, P, or C; T196 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; E197 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
D198 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E199 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D200 replaced with H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E201 replaced with H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, P, or C; G202 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
T203 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E204 replaced with H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G205 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; E206 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D207 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E208 replaced with 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G209 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; P210 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or C; Q21 1 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
W212 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S213 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; P214 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, or C; Q215 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, 
Y, P, or C; D216 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P217 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A218 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; L219 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; Q220 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G221 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V222 replaced with D, E, H, K, R, N, 
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Q, F, W, Y, P, or C; G223 replaced with D. E, H, K, R, N, Q, F, W, Y, P, or C; Q224 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P225 replaced with D, 
E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; T226 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; G227 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T228 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G229 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; S230 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 123 1 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; R232 replaced with D, E, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; K233 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; K234 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R235 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F236 replaced with D, E, H, K, R, 
N, Q, A, G, I, L, S, T, M, V, P, or C; V237 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S238 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S239 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; H240 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; R241 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
Y242 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V243 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; E244 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; T245 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M246 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L247 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V248 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A249 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D250 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; Q251 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, F, W, Y, P, or C; S252 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M253 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A254 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; E255 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; F256 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; H257 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G258 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; S259 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G260 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L261 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; K262 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; H263 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y264 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L265 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; L266 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T267 
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replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L268 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; F269 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 
C; S270 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V271 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; A272 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
A273 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R274 replaced with D, E, A, G, 
I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L275 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; Y276 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; K277 
replaced with D. E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H278 replaced with D, 
E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P279 replaced with D, E, H, K, R, A, G, 
I, L, S, T, M, V, N, Q, F, W, Y, or C; S280 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; 1281 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R282 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N283 replaced with D, E, H, K, R, A, G, I, 
L, S, T, M, V, F, W, Y, P, or C; S284 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
V285 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S286 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; L287 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V288 
replaced with D, E, H, K, R, N, Q, F. W, Y, P, or C; V289 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V290 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K291 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 1292 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; L293 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
V294 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1295 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; H296 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; D297 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E298 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q299 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; K300 replaced with D, E, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; G301 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; P302 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E303 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V304 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; T305 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S306 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N307 replaced with D, E, 
H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; A308 replaced with D, E, H, K, R, N, Q, 
F, W, Y, P, or C; A309 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L310 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; T31 1 replaced with D, E, H, K, R, N, Q, F, W, 
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Y, P, or C; L312 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R313 replaced with 

D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N314 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, F, W, Y, P, or C; F315 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, 
T, M, V, P, or C; C3 16 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
or P; N317 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; W318 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Q319 replaced with D, 

E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; K320 replaced with D, E, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, P, or C; Q321 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; H322 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
N323 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P324 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; P325 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; S326 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; D327 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y. P, 
or C; R328 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D329 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A330 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; E331 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; H332 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
Y333 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; D334 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T335 replaced with D, E, H, K, R, 
N, Q. F, W, Y, P, or C; A336 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1337 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L338 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; F339 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 
C; T340 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R341 replaced with D, E, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q342 replaced with D, E, H, K, R, A, G, I, L, 
S, T, M, V, F, W, Y, P, or C; D343 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; L344 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C345 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G346 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; S347 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
Q348 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; T349 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; C350 replaced with D, E, H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, or P; D351 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, P, or C; T352 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L353 replaced 
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with D, E, H, K, R, N, Q, F, W, Y, P, or C; G354 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; M355 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A356 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; D357 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; V358 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G359 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T360 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; V361 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C362 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D363 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P364 replaced with D, E, H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; S365 replaced with D, E, H, K, R, N, Q, F, W, 

10 Y, P, or C; R366 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S367 
Q replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; €368 replaced with D, E, H, K, R, A, 
W G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S369 replaced with D, E, H, K, R, N, Q, F, W, Y, 
y P, or C; V370 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1371 replaced with D, 
f[J E, H, K, R, N, Q, F, W, Y, P, or C; E372 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 

11 Q, F, W, Y, P, or C; D373 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
D or C; D374 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G375 
r replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L376 replaced with D, E, H, K, R, N, 
W Q, F, W, Y, P, or C; Q377 replaced with D, E, H, K, R, A, G, 1, L, S, T, M, V, F, W, Y, P, 
5 or C; A378 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A379 replaced with D, E, 

20 H, K, R, N, Q, F, W, Y, P, or C; F380 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, 

M, V, P, or C; T381 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T382 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; A383 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; H384 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E385 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L386 replaced with D, E, H, 

25 K, R, N, Q, F, W, Y, P, or C; G387 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

H388 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V389 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; F390 replaced with D, E, H, K, R, N, Q, A, G, I, L, 
S, T, M, V, P, or C; N391 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 
or C; M392 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P393 replaced with D, E, 

30 H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; H394 replaced with D, E, A, G, I, L, 

S, T, M, V, N, Q, F, W, Y, P, or C; D395 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; D396 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
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or C; A397 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K398 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q399 replaced with D, E, H, K, R, A, G, I, 
L, S, T, M, V, F, W, Y, P, or C; C400 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, or P; A401 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S402 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L403 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; N404 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 
or C; G405 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V406 replaced with D, E, 
H, K, R, N, Q, F, Y, P, or C; N407 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 
F, W, Y, P, or C; Q408 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 
10 C; D409 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S410 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H41 1 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; M412 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
W C; M413 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A414 replaced with D, E, H, 

hi K, R, N, Q, F, W, Y, P, or C; S415 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

jj5 M416 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L417 replaced with D, E, H, K, 

R, N, Q, F, W, Y, P, or C; S418 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N419 
O replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; L420 replaced with D, 

g E, H, K, R, N, Q, F, W, Y, P, or C; D42 1 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 

W Q, F, W, Y, P, or C; H422 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

m C; S423 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q424 replaced with D, E, H, 

K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P425 replaced with D, E, H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, or C; W426 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, 
M, V, P, or C; S427 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P428 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C429 replaced with D, E, H, K, 
25 R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S430 replaced with D, E, H, K, R, N, Q, F, 

W, Y, P, or C; A431 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Y432 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; M433 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; 1434 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T435 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S436 replaced with D, E, H, K, R, N, 
30 Q, F, W, Y, P, or C; F437 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 

C; L438 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D439 replaced with H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N440 replaced with D, E, H, K, R, A, G, I, 
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L, S, T, M, V, F, W, Y, P, or C; G441 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
H442 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G443 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; E444 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; C445 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
5 W, Y, or P; L446 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M447 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; D448 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; K449 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; P450 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; Q45 1 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; N452 replaced with 

10 D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P453 replaced with D, E, H, K, R, 

A, G, I, L, S, T, M, V, N, Q, F, W. Y, or C; 1454 replaced with D, E, H, K, R, N, Q, F, W, 

ft Y, P, or C; Q455 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; L456 

iu replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P457 replaced with D, E, H, K, R, A, 

^ G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G458 replaced with D, E, H, K, R, N, Q, F, W, Y, 

if^ P, or C; D459 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L460 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P461 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G462 replaced with D, E, H, K, R, N, Q, F, W, Y, 
tt p, or C; T463 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S464 replaced with D, 

: = ! E, H, K, R, N, Q, F, W, Y, P, or C; Y465 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, 

20:| T, M, V, P, or C; D466 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

C; A467 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N468 replaced with D, E, H, 
K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; R469 replaced with D, E, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; Q470 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, 
Y, P, or C; C471 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; 

25 Q472 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; F473 replaced 

with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; T474 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; F475 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 
or C; G476 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E477 replaced with H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D478 replaced with H, K, R, A, G, I, L, 

■0 S, T, M, V, N, Q, F, W, Y, P, or C; S479 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; K480 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H481 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C482 replaced with D, E, H, K, 
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R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; P483 replaced with D, E, H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, or C; D484 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, P, or C; A485 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A486 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; S487 replaced with D, E, H, K, R, N, Q, F, W, 
5 Y, P, or C; T488 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C489 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S490 replaced with D, E, H, K, 
R. N, Q, F, W, Y, P, or C; T491 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L492 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W493 replaced with D, E, H, K, R, N, 
Q, A, G, I, L, S, T, M, V, P, or C; C494 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

10 N, Q, F, W, Y, or P; T495 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G496 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T497 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; S498 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G499 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G500 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V501 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L502 

1 5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V503 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; C504 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or P; Q505 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; T506 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K507 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; H508 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 

20 F, W, Y, P, or C; F509 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; 

P510 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; W511 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; A512 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; D513 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; GSM replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T515 

25 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S516 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; C517 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or P; G518 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E519 replaced with H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G520 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; K521 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

30 C; W522 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; C523 replaced 

with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; 1524 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; N525 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, 
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W, Y, P, or C; G526 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K527 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C528 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; V529 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; N530 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
K53 1 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T532 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; D533 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; R534 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; K535 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H536 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F537 replaced with D, E, 
H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; D538 replaced with H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, P, or C; T539 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
P540 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; F541 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; H542 replaced with D, E, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; G543 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S544 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W545 replaced with D, E, 
H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G546 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; M547 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W548 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G549 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; P550 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or C; W551 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G552 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D553 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; C554 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, or P; S555 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R556 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T557 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; C558 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, or P; G559 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G560 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G561 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V562 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q563 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; Y564 replaced with 
D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; T565 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; M566 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R567 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E568 replaced with H, 
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K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C569 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D570 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; N571 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, 
Y, P, or C; P572 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
5 V573 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P574 replaced with D, E, H, K, 

R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K575 replaced with D, E, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; N576 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, 
Y, P, or C; G577 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G578 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; K579 replaced with D, E, A, G, I, L, S, T, M, V, N, 

10 Q, F, W, Y, P, or C; Y580 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 

C; C581 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; E582 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G583 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; K584 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; R585 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

15 C; V586 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R587 replaced with D, E, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y588 replaced with D, E, H, K, R, N, Q, A, G, 
I, L, S, T, M, V, P, or C; R589 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; S590 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C591 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; N592 replaced with D, E, H, K, R, A, 
20 G, I, L, S, T, M, V, F, W, Y, P, or C; L593 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; E594 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D595 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C596 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; P597 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; D598 replaced with H, K, R, A, G, I, L, S, 
25 T, M, V, N, Q, F, W, Y, P, or C; N599 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; N600 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 
C; G601 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K602 replaced with D, E, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T603 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; F604 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; R605 

30 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E606 replaced with H, 

K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E607 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; Q608 replaced with D, E, H, K, R, A, G, I, L, S, T, 
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M, V, F, W, Y, P, or C; C609 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or P; B610 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
A61 1 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H612 replaced with D, E, A, G, 
I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N613 replaced with D, E, H, K, R, A, G, I, L, S, T, 
5 M, V, F, W, Y, P, or C; E614 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 

P, or C; F615 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S616 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K617 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; A618 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; S619 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F620 replaced with D, E, H, 

10 K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G621 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; S622 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G623 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; P624 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, or C; A625 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V626 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E627 replaced with H, K, R, A, G, I, 

15 L, S, T, M, V, N, Q, F, W, Y, P, or C; W628 replaced with D, E, H, K, R, N, Q, A, G, I, L, 

S, T, M, V, P, or C; 1629 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P630 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K631 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y632 replaced with D, E, H, K, R, N, Q, A, 
G, I, L, S, T, M, V, P, or C; A633 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G634 

20 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V635 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; S636 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P637 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K638 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D639 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; R640 replaced with D, E, A, G, I, L, S, T, M, V, N, 

25 Q, F, W, Y, P, or C; C641 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 

Y, or P; K642 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L643 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1644 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; C645 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or P; Q646 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; A647 

50 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K648 replaced with D, E, A, G, I, L, 

S, T, M, V, N, Q, F, W, Y, P, or C; G649 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; 1650 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G651 replaced with D, E, H, 
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K, R, N, Q, F, W, Y, P, or C; Y652 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, 
V, P, or C; F653 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; F654 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V655 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; L656 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
5 C; Q657 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P658 replaced 

with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K659 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V660 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; V661 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D662 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G663 replaced with D, E, H, 

10 K, R, N, Q, F, W, Y, P, or C; T664 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

P665 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C666 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S667 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; P668 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, or C; D669 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 

15 Y, P, or C; S670 replaced with D, E, H, K, R, N. Q, F, W, Y, P, or C; T671 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; S672 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; V673 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C674 replaced with D, E, 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; V675 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; Q676 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 

20 or C; G677 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q678 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; C679 replaced with D, E, H, K, R, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, or P; V680 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; K681 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A682 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G683 replaced with D, E, H, K, R, N, 

25 Q, F, W, Y, P, or C; C684 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 

Y, or P; D685 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R686 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 1687 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; 1688 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
D689 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S690 replaced 

30 with D, E, H, K, R, N, Q, F, W, Y, P, or C; K691 replaced with D, E, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, P, or C; K692 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; K693 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F694 
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replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; D695 replaced with H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K696 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; C697 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, or P; G698 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V699 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C700 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G701 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; G702 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N703 replaced with D, 
E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G704 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; S705 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T706 

10 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C707 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or P; K708 replaced with D, E, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; K709 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; 17 10 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S7 1 1 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; G712 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

15 S713 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V714 replaced with D, E, H, K, 

R, N, Q, F, W, Y, P, or C; T715 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S716 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A717 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; K718 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; P719 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G720 

20 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Y72 1 replaced with D, E, H, K, R, N, 

Q, A, G, I, L, S, T, M, V, P, or C; H722 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; D723 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
1724 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1725 replaced with D, E, H, K, R, 
N, Q. F, W, Y, P, or C; T726 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1727 

25 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P728 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; T729 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; G730 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A73 1 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; T732 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; N733 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 1734 replaced 

30 with D, E, H, K, R, N, Q, F, W, Y, P, or C; E735 replaced with H, K, R, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, P, or C; V736 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K737 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q738 replaced with D, 



-156- 



E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; R739 replaced with D, E, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, P, or C; N740 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; Q741 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 
C; R742 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G743 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; S744 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; R745 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N746 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; N747 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G748 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; S749 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F750 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L75 1 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; A752 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
1753 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K754 replaced with D, E, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A755 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; A756 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D757 replaced with H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G758 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; T759 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Y760 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; 1761 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; L762 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
N763 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G764 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; D765 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; Y766 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, 
P, or C; T767 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L768 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; S769 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; T770 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L77 1 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; E772 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; Q773 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
D774 replaced with H, K, R, A, G, I, L, S, T, M. V, N, Q, F, W, Y, P, or C; 1775 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; M776 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; Y777 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; K778 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G779 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; V780 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; V781 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L782 replaced with D, E, H, 
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K, R, N, Q, F, W, Y, P, or C; R783 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; Y784 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S785 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G786 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; S787 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S788 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A789 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; A790 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L791 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E792 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; R793 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; 1794 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R795 

10 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S796 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; F797 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, 
M, V, P, or C; S798 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P799 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L800 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; K801 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 

15 P, or C; E802 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P803 

replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L804 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; T805 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; 1806 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q807 replaced with D, E, 
H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; V808 replaced with D, E, H, K, R, N, Q, 

20 F, W, Y, P, or C; L809 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T8 10 replaced 

with D, E, H, K, R, N, Q, F, W, Y, P, or C; V81 1 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; G812 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N813 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; A814 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; L815 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R816 

25 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P817 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K818 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; 18 19 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; K820 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y821 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; T822 replaced with D, E, H, K, R, 

30 N, Q, F, W, Y, P, or C; Y823 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 

or C; F824 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V825 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; K826 replaced with D, E, A, G, I, L, S, T, M, 
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V, N, Q, F, W, Y, P, or C; K827 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; K828 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K829 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E830 replaced with H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S83 1 replaced with D, E, H, K, R, N. 

5 Q, F, W, Y, P, or C; F832 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 

C; N833 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; A834 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1835 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; P836 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or C; T837 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F838 replaced with D, 

10 E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S839 replaced with D, E, H, K, R, N, Q, 

F, W, Y, P, or C; A840 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W841 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V842 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; 1843 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E844 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E845 replaced with 

15 H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; W846 replaced with D, E, H, K, R, 

N, Q, A, G, I, L, S, T, M, V, P, or C; G847 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; E848 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C849 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S850 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; K851 replaced with D, E, A, G, I, L, S, T, M, V, N, 
20 Q, F, W, Y, P, or C; S852 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C853 

replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; E854 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L855 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; G856 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W857 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Q858 replaced with D, 

25 E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; R859 replaced with D, E, A, G, I, L, S, 

T, M, V, N, Q, F, W, Y, P, or C; R860 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; L861 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V862 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; E863 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; C864 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 

30 F, W, Y, or P; R865 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D866 

replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 1867 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; N868 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
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V, F, W, Y, P, or C; G869 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q870 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P87 1 replaced with D, 
E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A872 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; S873 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E874 
5 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C875 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; A876 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; K877 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; E878 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V879 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K880 replaced with D, E, A, G, I, L, 

10 S, T, M, V, N, Q, F, W, Y, P, or C; P88 1 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, or C; A882 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S883 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T884 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; R885 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; P886 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C887 

15 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; A888 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; D889 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; H890 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; P891 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C892 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; P893 replaced with 

20 D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; Q894 replaced with D, E, H, K, 

R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; W895 replaced with D, E, H, K, R, N, Q, A, G, 
I, L, S, T, M, V, P, or C; Q896 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, 
P, or C; L897 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G898 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; E899 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
25 Q, F, W, Y, P, or C; W900 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 

C; S901 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S902 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; C903 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, or P; S904 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K905 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T906 replaced with D, E, H, K, R, 
30 N, Q, F, W, Y, P, or C; C907 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 

W, Y, or P; G908 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K909 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G910 replaced with D, E, H, K, R, N, 
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Q, F, W, Y, P, or C; Y91 1 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 
C; K912 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K913 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R914 replaced with D, E, A, G, 
I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S915 replaced with D, E, H, K, R, N, Q, F, W, Y, 
5 P, or C; L916 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K917 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C918 replaced with D, E, H, K, R, A, G, 
I, L, S, T, M, V, N, Q, F, W, Y, or P; L919 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S920 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H921 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D922 replaced with H, K, R, A, G, I, L, S, T, 

10 M, V, N, Q, F, W, Y, P, or C; G923 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

G924 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V925 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; L926 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S927 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H928 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; E929 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 

15 Q, F, W, Y, P, or C; S930 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C931 

replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D932 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P933 replaced with D, E, H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L934 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; K935 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K936 

20 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P937 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K938 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; H939 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 

F, W, Y, P, or C; F940 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; 
1941 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D942 replaced with H, K, R, A, 

25 G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F943 replaced with D, E, H, K, R, N, Q, A, G, 

I, L, S, T, M, V, P, or C; C944 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or P; T945 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M946 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; A947 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; E948 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C949 

30 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S950 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C. 
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Also preferred are METH2 polypeptides with one or more of the following 
conservative amino acid substitutions: Ml replaced with A, G, I, L, S, T, or V; F2 replaced 
with W, or Y; A4 replaced with G, I, L, S, T, M, or V; A6 replaced with G, I, L, S, T, M, or 
V; A7 replaced with G, 1, L, S, T, M, or V; R9 replaced with H, or K; WIO replaced with 
5 F, or Y; LI 1 replaced with A, G, I, S, T, M, or V; F13 replaced with W, or Y; L14 replaced 

with A, G, I, S, T, M, or V; L15 replaced with A, G, I, S, T, M, or V; L16 replaced with A, 
G, I, S, T, M, or V; L17 replaced with A, G, I, S, T, M, or V; L18 replaced with A, G, I, S, 
T, M, or V; L19 replaced with A, G, I, S, T, M, or V; L20 replaced with A, G, I, S, T, M, or 
V; L21 replaced with A, G, I, S, T, M, or V; L22 replaced with A, G, I, S, T, M, or V; L24 

10 replaced with A, G, I, S, T, M, or V; A25 replaced with G, I, L, S, T, M, or V; R26 replaced 

with H, or K; G27 replaced with A, I, L, S, T, M, or V; A28 replaced with G, I, L, S, T, M, 
or V; A30 replaced with G, I, L, S, T, M, or V; R31 replaced with H, or K; ASS replaced 
with G, I, L, S, T, M, or V; A34 replaced with G, I, L, S, T, M, or V; G35 replaced with A, 
I, L, S, T, M, or V; GS6 replaced with A, I, L, S, T, M, or V; Q37 replaced with N; ASS 

15 replaced with G, I, L, S, T, M, or V; SS9 replaced with A, G, I, L, T, M, or V; E40 replaced 

with D; L41 replaced with A, G, I, S, T, M, or V; V42 replaced with A, G, I, L, S, T, or M; 
V43 replaced with A, G, I, L, S, T, or M; T45 replaced with A, G, I, L, S, M, or V; R46 
replaced with H, or K; L47 replaced with A, G, I, S, T, M, or V; G49 replaced with A, I, L, 
S, T, M, or V; S50 replaced with A, G, I, L, T, M, or V; A51 replaced with G, I, L, S, T, M, 

20 or V; G52 replaced with A, I, L, S, T, M, or V; ESS replaced with D; L54 replaced with A, 

G, I, S, T, M, or V; A55 replaced with G, I, L, S, T, M, or V; L56 replaced with A, G, I, S, 
T, M, or V; H57 replaced with K, or R; L58 replaced with A, G, I, S, T, M, or V; S59 
replaced with A, G, I, L, T, M, or V; A60 replaced with G, I, L, S, T, M, or V; F61 replaced 
with W, or Y; G62 replaced with A, I, L, S, T, M, or V; K6S replaced with H, or R; G64 

25 replaced with A, I, L, S, T, M, or V; F65 replaced with W, or Y; V66 replaced with A, G, 

I, L, S, T, or M; L67 replaced with A, G, I, S, T, M, or V; R68 replaced with H, or K; L69 
replaced with A, G, I, S, T, M, or V; A70 replaced with G, I, L, S, T, M, or V; D72 replaced 
with E; D7S replaced with E; S74 replaced with A, G, I, L, T, M, or V; F75 replaced with 
W, or Y; L76 replaced with A, G, I, S, T, M, or V; A77 replaced with G, I, L, S, T, M, or 

30 V; E79 replaced with D; F80 replaced with W, or Y; K81 replaced with H, or R; 182 

replaced with A, G, L, S, T, M, or V; E83 replaced with D; R84 replaced with H, or K; L85 
replaced with A, G, I, S, T, M, or V; G86 replaced with A, I, L, S, T, M, or V; G87 replaced 
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with A, I, L, S, T, M, or V; S88 replaced with A, G, I, L, T, M, or V; G89 replaced with A, 
I, L, S, T, M, or V; R90 replaced with H, or K; A91 replaced with G, I, L, S, T, M, or V; T92 
replaced with A, G, I, L, S, M, or V; G93 replaced with A, I, L, S, T, M, or V; G94 replaced 
with A, I, L, S, T, M, or V; E95 replaced with D; R96 replaced with H, or K; G97 replaced 
5 with A, I, L, S, T, M, or V; L98 replaced with A, G, I, S, T, M, or V; R99 replaced with H, 

or K; GlOO replaced with A, I, L, S, T, M, or V; F102 replaced with W, or Y; F103 replaced 
with W, or Y; S 104 replaced with A, G, I, L, T, M, or V; G105 replaced with A, I, L, S, T, 
M, or V; T106 replaced with A, G, I, L, S, M, or V; V107 replaced with A, G, I, L, S, T, or 
M; N108 replaced with Q; G109 replaced with A, I, L, S, T, M, or V; El 10 replaced with 

10 D; El 12 replaced with D; S 1 13 replaced with A, G, I, L, T, M, or V; LI 14 replaced with A, 

G, I, S, T, M, or V; Al 15 replaced with G, I, L, S, T, M, or V; Al 16 replaced with G, I, L, 
S, T, M, or V; VI 17 replaced with A, G, I, L, S, T, or M; S 1 18 replaced with A, G, I, L, T, 
M, or V; LI 19 replaced with A, G, I, S, T, M, or V; R121 replaced with H, or K; G122 
replaced with A, I, L, S, T, M, or V; L123 replaced with A, G, I, S, T, M, or V; S124 

15 replaced with A, G, I, L, T, M, or V; G125 replaced with A, I, L, S, T, M, or V; S126 

replaced with A, G, I, L, T, M, or V; F127 replaced with W, or Y; L128 replaced with A, G, 
I, S, T, M, or V; L129 replaced with A, G, I, S, T, M, or V; D130 replaced with E; G131 
replaced with A, I, L, S, T, M, or V; E132 replaced with D; E133 replaced with D; F134 
replaced with W, or Y; T135 replaced with A, G, I, L, S, M, or V; 1136 replaced with A, G, 

20 L, S, T, M, or V; Q137 replaced with N; Q139 replaced with N; G140 replaced with A, I, 

L, S, T, M, or V; A141 replaced with G, I, L, S, T, M, or V; G142 replaced with A, I, L, S, 
T, M, or V; G143 replaced with A, I, L, S, T, M, or V; S 144 replaced with A, G, I, L, T, M, 
or V; L145 replaced with A, G, I, S, T, M, or V; A 146 replaced with G, I, L, S, T, M, or V; 
Q147 replaced with N; H149 replaced with K, or R; R150 replaced with H, or K; L151 

25 replaced with A, G, I, S, T, M, or V; Q152 replaced with N; R153 replaced with H, or K; 

W 1 54 replaced with F, or Y; G 1 55 replaced with A, I, L, S , T, M, or V ; A 1 57 replaced with 
G, I, L, S, T, M, or V; G158 replaced with A, I, L, S, T, M, or V; A159 replaced with G, I, 
L, S, T, M, or V; R160 replaced with H, or K; L162 replaced with A, G, I, S, T, M, or V; 
R164 replaced with H, or K; G165 replaced with A, I, L, S, T, M, or V; E167 replaced with 

30 D; W168 replaced with F, or Y; E169 replaced with D; V170 replaced with A, G, I, L, S, T, 

or M; E17 1 replaced with D; T172 replaced with A, G, I, L, S, M, or V; G173 replaced with 
A, 1, U S, T, M, or V; E174 replaced with D; G175 replaced with A, I, L, S, T, M, or V; 



-163- 



Q176 replaced with N; R177 replaced with H, or K; Q178 replaced with N; E179 replaced 
with D; R180 replaced with H, or K; G181 replaced with A, I, L, S, T, M, or V; D182 
replaced with E; H183 replaced with K, or R; Q184 replaced with N; E185 replaced with 
D; D 1 86 replaced with E; S 1 87 replaced with A, G, I, L, T, M, or V ; E 1 88 replaced with D; 
5 El 89 replaced with D; E190 replaced with D; S191 replaced with A, G, I, L, T, M, or V; 

Q192 replaced with N; E193 replaced with D; E194 replaced with D; E195 replaced with 
D; A196 replaced with G, I, L, S, T, M, or V; E197 replaced with D; G198 replaced with A, 
I, L, S, T, M, or V; A199 replaced with G, I, L, S, T, M, or V; S200 replaced with A, G, I, 
L, T, M, or V; E201 replaced with D; L206 replaced with A, G, I, S, T, M, or V; G207 

10 replaced with A, I, L, S, T, M, or V; A208 replaced with G, I, L, S, T, M, or V; T209 

replaced with A, G, I, L, S, M, or V; S210 replaced with A, G, I, L, T, or V; R21 1 
replaced with H, or K; T212 replaced with A, G, I, L, S„ M, or V; K213 replaced with H, or 
R; R214 replaced with H, or K; F215 replaced with W, or Y; V216 replaced with A, G, I, 
L, S, T, or M; S217 replaced with A, G, I, L, T, M, or V; E218 replaced with D; A219 

15 replaced with G, I, L, S, T, M, or V; R220 replaced with H, or K; F221 replaced with W, or 

Y; V222 replaced with A, G, I, L, S, T, or M; E223 replaced with D; T224 replaced with A, 
G, I, L, S, M, or V; L225 replaced with A, G, I, S, T, M, or V; L226 replaced with A, G, I, 
S, T, M, or V; V227 replaced with A, G, I, L, S, T, or M; A228 replaced with G, I, L, S, T, 
M, or V; D229 replaced with E; A230 replaced with G, I, L, S, T, M, or V; S231 replaced 

20 with A, G, I, L, T, M, or V; M232 replaced with A, G, I, L, S, T, or V; A233 replaced with 

G, I, U S, T, M, or V; A234 replaced with G, I, L, S, T, M, or V; F235 replaced with W, or 
Y; Y236 replaced with F, or W; G237 replaced with A, I, L, S, T, M, or V; A238 replaced 
with G, I, L, S, T, M, or V; D239 replaced with E; L240 replaced with A, G, I, T, M, or 
V; Q241 replaced with N; N242 replaced with Q; H243 replaced with K, or R; 1244 replaced 

25 with A, G, S, T, M, or V; L245 replaced with A, G, I, S, T, M, or V; T246 replaced with 

A, G, I, L, S, M, or V; L247 replaced with A, G, I, S, T, M, or V; M248 replaced with A, G, 
I, L, S, T, or V; S249 replaced with A, G, I, L, T, M, or V; V250 replaced with A, G, I, L, 
S, T, or M; A25 1 replaced with G, I, L, S, T, M, or V; A252 replaced with G, I, L, S, T, M, 
or V; R253 replaced with H, or K; 1254 replaced with A, G, L, S, T, M, or V; Y255 replaced 

30 with F, or W; K256 replaced with H, or R; H257 replaced with K, or R; S259 replaced with 

A, G, I, L, T, M, or V; 1260 replaced with A, G, L, S, T, M, or V; K261 replaced with H, or 
R; N262 replaced with Q; S263 replaced with A, G, I, L, T, M, or V; 1264 replaced with A, 
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G, L, S, T, M, or V; N265 replaced with Q; L266 replaced with A, G, I, S, T, M, or V; M267 
replaced with A, G, I, L, S, T, or V; V268 replaced with A, G, I, L, S, T, or M; V269 
replaced with A, G, I, L, S, T, or M; K270 replaced with H, or R; V271 replaced with A, G, 
I, L, S, T, or M; L272 replaced with A, G, I, S, T, M, or V; 1273 replaced with A, G, L, S, 

5 T, M, or V; V274 replaced with A, G, I, L, S, T, or M; E275 replaced with D; D276 replaced 

with E; E277 replaced with D; K278 replaced with H, or R; W279 replaced with F, or Y; 
G280 replaced with A, I, L, S, T, M, or V; E282 replaced with D; V283 replaced with A, G, 
I, L, S, T, or M; S284 replaced with A, G, I, L, T, M, or V; D285 replaced with E; N286 
replaced with Q; G287 replaced with A, I, L, S, T, M, or V; G288 replaced with A, I, L, S, 

10 T, M, or V; L289 replaced with A, G, I, S, T, M, or V; T290 replaced with A, G, I, L, S, M, 

or V; L29 1 replaced with A, G, I, S, T, M, or V; R292 replaced with H, or K; N293 replaced 
with Q; F294 replaced with W, or Y; N296 replaced with Q; W297 replaced with F, or Y; 
Q298 replaced with N; R299 replaced with H, or K; R300 replaced with H, or K; F301 
replaced with W, or Y; N302 replaced with Q; Q303 replaced with N; S305 replaced with 

15 A, G, I, L, T, M, or V; D306 replaced with E; R307 replaced with H, or K; H308 replaced 

with K, or R; E3 10 replaced with D; H3 1 1 replaced with K, or R; Y3 12 replaced with F, or 
W; D313 replaced with E; T314 replaced with A, G, I, L, S, M, or V; A3 15 replaced with 
G, I, L, S, T, M, or V; 1316 replaced with A, G, L, S, T, M, or V; L317 replaced with A, G, 
I, S, T, M, or V; L318 replaced with A, G, I, S, T, M, or V; T319 replaced with A, G, I, L, 

20 S, M, or V; R320 replaced with H, or K; Q321 replaced with N; N322 replaced with Q; 

F323 replaced with W, or Y; G325 replaced with A, I, L, S, T, M, or V; Q326 replaced with 
N; E327 replaced with D; G328 replaced with A, I, L, S, T, M, or V; L329 replaced with A, 
G, I, S, T, M, or V; D331 replaced with E; T332 replaced with A, G, I, L, S, M, or V; L333 
replaced with A, G, I, S, T, M, or V; G334 replaced with A, I, L, S, T, M, or V; V335 

25 replaced with A, G, I, L, S, T, or M; A336 replaced with G, I, L, S, T, M, or V; D337 

replaced with E; 1338 replaced with A, G, L, S, T, M, or V; G339 replaced with A, I, L, S, 
T, M, or V; T340 replaced with A, G, I, L, S, M, or V; 1341 replaced with A, G, L, S, T, M, 
or V; D343 replaced with E; N345 replaced with Q; K346 replaced with H, or R; S347 
replaced with A, G, I, L, T, M, or V; S349 replaced with A, G, I, L, T, M, or V; V350 

30 replaced with A, G, I, L, S, T, or M; 1351 replaced with A, G, L, S, T, M, or V; E352 

replaced with D; D353 replaced with E; E354 replaced with D; G355 replaced with A, I, L, 
S, T, M, or V; L356 replaced with A, G, I, S, T, M, or V; Q357 replaced with N; A358 



-165- 



replaced with G, I, L, S, T, M, or V; A359 replaced with G, I, L, S, T, M, or V; H360 
replaced with K, or R; T361 replaced with A, G, I, L, S, M, or V; L362 replaced with A, G, 
I, S, T, M, or V; A363 replaced with G, I, L, S, T, M, or V; H364 replaced with K, or R; 
E365 replaced with D; L366 replaced with A, G, I, S, T, M, or V; G367 replaced with A, I, 
5 L, S, T, M, or V; H368 replaced with K, or R; V369 replaced with A, G, I, L, S, T, or M; 

L370 replaced with A, G, I, S, T, M, or V; S371 replaced with A, G, I, L, T, M, or V; M372 
replaced with A, G, I, L, S, T, or V; H374 replaced with K, or R; D375 replaced with E; 
D376 replaced with E; S377 replaced with A, G, I, L, T, M, or V; K378 replaced with H, or 
R; T381 replaced with A, G, I, L, S, M, or V; R382 replaced with H, or K; L383 replaced 

10 with A, G, I, S, T, M, or V; F384 replaced with W, or Y; G385 replaced with A, I, L, S, T, 

M, or V; M387 replaced with A, G, I, L, S, T, or V; G388 replaced with A, I, L, S, T, M, or 
V; K389 replaced with H, or R; H390 replaced with K, or R; H391 replaced with K, or R; 
V392 replaced with A, G, I, L, S, T, or M; M393 replaced with A, G, I, L, S, T, or V; A394 
replaced with G, I, L, S, T, M, or V; L396 replaced with A, G, I, S, T, M, or V; F397 

15 replaced with W, or Y; V398 replaced with A, G, I, L, S, T, or M; H399 replaced with K, 

or R; L400 replaced with A, G, I, S, T, M, or V; N401 replaced with Q; Q402 replaced with 
N; T403 replaced with A, G, I, L, S, M, or V; L404 replaced with A, G, I, S, T, M, or V; 
W406 replaced with F, or Y; S407 replaced with A, G, I, L, T, M, or V; S410 replaced with 
A, G, I, L, T, M, or V; A41 1 replaced with G, I, L, S, T, M, or V; M412 replaced with A, 

20 G, I, L, S, T, or V; Y413 replaced with F, or W; L414 replaced with A, G, I, S, T, M, or V; 

T415 replaced with A, G, I, L, S, M, or V; E416 replaced with D; L417 replaced with A, G, 
I, S, T, M, or V; L418 replaced with A, G, I, S, T, M, or V; D419 replaced with E; G420 
replaced with A, I, L, S, T, M, or V; G421 replaced with A, I, L, S, T, M, or V; H422 
replaced with K, or R; G423 replaced with A, I, L, S, T, M, or V; D424 replaced with E; 

25 L426 replaced with A, G, I, S, T, M, or V; L427 replaced with A, G, I, S, T, M, or V; D428 

replaced with E; A429 replaced with G, I, L, S, T, M, or V; G43 1 replaced with A, I, L, S, 
T, M, or V; A432 replaced with G, I, L, S, T, M, or V; A433 replaced with G, I, L, S, T, M, 
or V; L434 replaced with A, G, I, S, T, M, or V; L436 replaced with A, G, I, S, T, M, or V; 
T438 replaced with A, G, I, L, S, M, or V; G439 replaced with A, I, L, S, T, M, or V; L440 

30 replaced with A, G, I, S, T, M, or V; G442 replaced with A, I, L, S, T, M, or V; R443 

replaced with H, or K; M444 replaced with A, G, I, L, S, T, or V; A445 replaced with G, I, 
L, S, T, M, or V; L446 replaced with A, G, I, S, T, M, or V; Y447 replaced with F, or W; 
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Q448 replaced with N; L449 replaced with A, G, I, S, T, M, or V; D450 replaced with E; 
Q451 replaced with N; Q452 replaced with N; R454 replaced with H, or K; Q455 replaced 
with N; 1456 replaced with A, G, L, S, T, M, or V; F457 replaced with W, or Y; G458 
replaced with A, I, L, S, T, M, or V; D460 replaced with E; F461 replaced with W, or Y; 
5 R462 replaced with H, or K; H463 replaced with K, or R; N466 replaced with Q; T467 

replaced with A, G, I, L, S, M, or V; S468 replaced with A, G, I, L, T, M, or V; A469 
replaced with G, I, L, S, T, M, or V; Q470 replaced with N; D471 replaced with E; V472 
replaced with A, G, I, L, S, T, or M; A474 replaced with G, I, U S, T, M, or V; Q475 
replaced with N; L476 replaced with A, G, I, S, T, M, or V; W477 replaced with F, or Y; 

10 H479 replaced with K, or R; T480 replaced with A, G, I, L, S, M, or V; D481 replaced with 

E; G482 replaced with A, I, L, S, T, M, or V; A483 replaced with G, I, L, S, T, M, or V; 
E484 replaced with D; L486 replaced with A, G, I, S, T, or V; H488 replaced with K, or 
R; T489 replaced with A, G, I, L, S, M, or V; K490 replaced with H, or R; N491 replaced 
with Q; G492 replaced with A, I, L, S, T, M, or V; S493 replaced with A, G, I, L, T, M, or 

15 V; L494 replaced with A, G, I, S, M, or V; W496 replaced with F, or Y; A497 replaced 

with G, I, L, S, T, M, or V; D498 replaced with E; G499 replaced with A, I, L, S, T, M, or 
V; T500 replaced with A, G, I, L, S, M, or V; G503 replaced with A, I, L, S, T, M, or V; 
G505 replaced with A, I, L, S, T, M, or V; H506 replaced with K, or R; L507 replaced with 
A, G, I, S, T, M, or V; S509 replaced with A, G, I, L, T, M, or V; E510 replaced with D; 

20 G51 1 replaced with A, I, L, S, T, M, or V; S512 replaced with A, G, I, L, T, M, or V; L514 

replaced with A, G, I, S, T, M, or V; E516 replaced with D; E517 replaced with D; E518 
replaced with D; V519 replaced with A, G, I, L, S, T, or M; E520 replaced with D; R521 
replaced with H, or K; K523 replaced with H, or R; V525 replaced with A, G, I, L, S, T, or 
M; V526 replaced with A, G, I, L, S, T, or M; D527 replaced with E; G528 replaced with 

25 A, I, L, S, T, M, or V; G529 replaced with A, I, L, S, T, M, or V; W530 replaced with F, or 

Y; A53 1 replaced with G, I, L, S, T, M, or V; W533 replaced with F, or Y; G534 replaced 
with A, I, L, S, T, M, or V; W536 replaced with F, or Y; G537 replaced with A, I, L, S, T, 
M, or V; E538 replaced with D; S540 replaced with A, G, I, L, T, M, or V; R541 replaced 
with H, or K; T542 replaced with A, G, I, L, S, M, or V; G544 replaced with A, I, L, T, 

30 M, or V; G545 replaced with A, I, L, T, M, or V; G546 replaced with A, I, L, S, T, M, or 

V; V547 replaced with A, G, I, L, S, T, or M; Q548 replaced with N; F549 replaced with W, 
or Y; S550 replaced with A, G, I, L, T, M, or V; H55 1 replaced with K, or R; R552 replaced 
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with H, or K; E553 replaced with D; K555 replaced with H, or R; D556 replaced with E; 
E558 replaced with D; Q560 replaced with N; N561 replaced with Q; G562 replaced with 
A, I, L, S, T, M, or V; G563 replaced with A, I, L, S, T, M, or V; R564 replaced with H, or 
K; Y565 replaced with F, or W; L567 replaced with A, G, I, S, T, M, or V; G568 replaced 
5 with A, I, L, S, T, M, or V; R569 replaced with H, or K; R570 replaced with H, or K; A571 

replaced with G, I, L, S, T, M, or V; K572 replaced with H, or R; Y573 replaced with F, or 
W; Q574 replaced with N; S575 replaced with A, G, I, L, T, M, or V; H577 replaced with 
K, or R; T578 replaced with A, G, I, L, S, M, or V; E579 replaced with D; E580 replaced 
with D; D584 replaced with E; G585 replaced with A, I, L, S, M, or V; K586 replaced 

10 with H, or R; S587 replaced with A, G, I, L, T, M, or V; F588 replaced with W, or Y; R589 

replaced with H, or K; E590 replaced with D; Q591 replaced with N; Q592 replaced with 
N; E594 replaced with D; K595 replaced with H, or R; Y596 replaced with F, or W; N597 
replaced with Q; A598 replaced with G, I, L, S, T, M, or V; Y599 replaced with F, or W; 
N600 replaced with Q; Y601 replaced with F, or W; T602 replaced with A, G, I, L, S, M, 

1 5 or V; D603 replaced with E; M604 replaced with A, G, I, L, S, T, or V; D605 replaced with 

E; G606 replaced with A, I, L, S, T, M, or V; N607 replaced with Q; L608 replaced with A, 
G, I, S, T, M, or V; L609 replaced with A, G, I, S, T, M, or V; Q610 replaced with N; W61 1 
replaced with F, or Y; V612 replaced with A, G, I, L, S, T, or M; K614 replaced with H, or 
R; Y615 replaced with F, or W; A616 replaced with G, I, L, S, T, M, or V; G617 replaced 

20 with A, I, L, S, T, M, or V; V618 replaced with A, G, I, L, S, T, or M; S619 replaced with 

A, G, I, L, T, M, or V; R621 replaced with H, or K; D622 replaced with E; R623 replaced 
with H, or K; K625 replaced with H, or R; L626 replaced with A, G, I, S, T, M, or V; F627 
replaced with or Y; R629 replaced with H, or K; A630 replaced with G, I, L, S, T, M, 
or V; R63 1 replaced with H, or K; G632 replaced with A, I, L, S, T, M, or V; R633 replaced 

25 with H, or K; S634 replaced with A, G, I, L, T, M, or V; E635 replaced with D; F636 

replaced with W, or Y; K637 replaced with H, or R; V638 replaced with A, G, I, L, S, T, or 
M; F639 replaced with W, or Y; E640 replaced with D; A641 replaced with G, I, L, S, T, 
M, or V; K642 replaced with H, or R; V643 replaced with A, G, I, L, S, T, or M; 1644 
replaced with A, G, L, S, T, M, or V; D645 replaced with E; G646 replaced with A, I, L, S, 

30 T, M, or V; T647 replaced with A, G, I, L, S, M, or V; L648 replaced with A, G, I, S, T, M, 

or V; G650 replaced with A, I, L, S, T, M, or V; E652 replaced with D; T653 replaced with 
A, G, I, L, S, M, or V; L654 replaced with A, G, I, S, T, M, or V; A655 replaced with G, I, 
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L, S, T, M, or V; 1656 replaced with A, G, L, S, T, M, or V; V658 replaced with A, G, I, L, 
S, T, or M; R659 replaced with H, or K; G660 replaced with A, I, L, S, T, M, or V; Q661 
replaced with N; V663 replaced with A, G, I, L, S, T, or M; K664 replaced with H, or R; 
A665 replaced with G, I, L, S, T, M, or V; G666 replaced with A, I, L, S, T, M, or V; D668 
5 replaced with E; H669 replaced with K, or R; V670 replaced with A, G, I, L, S, T, or M; 

V671 replaced with A, G, I, L, S, T, or M; D672 replaced with E; S673 replaced with A, G, 
I, L, T, M, or V; R675 replaced with H, or K; K676 replaced with H, or R; L677 replaced 
with A, G, I, S, T, M, or V; D678 replaced with E; K679 replaced with H, or R; G681 
replaced with A, I, L, S, T, M, or V; V682 replaced with A, G, I, L, S, T, or M; G684 
10 replaced with A, I, L, S, T, M, or V; G685 replaced with A, I, L, S, T, M, or V; K686 

replaced with H, or R; G687 replaced with A, I, L, S, T, M, or V; N688 replaced with Q; 
S689 replaced with A, G, I, L, T, M, or V; R691 replaced with H, or K; K692 replaced with 

H, or R; V693 replaced with A, G, I, L, S, T, or M; S694 replaced with A, G, I, L, T, M, or 
V; G695 replaced with A, I, L, S, T, M, or V; S696 replaced with A, G, I, L, T, M, or V; 

1 5 L697 replaced with A, G, I, S, T, M, or V; T698 replaced with A, G, I, L, S, M, or V; T700 

replaced with A, G, I, L, S, M, or V; N701 replaced with Q; Y702 replaced with F, or W; 
G703 replaced with A, I, L, S, T, M, or V; Y704 replaced with F, or W; N705 replaced with 
Q; D706 replaced with E; 1707 replaced with A, G, L, S, T, M, or V; V708 replaced with A, 
G, I, L, S, T, or M; T709 replaced with A, G, I, L, S, M, or V; 1710 replaced with A, G, L, 

20 S, T, M, or V; A712 replaced with G, I, L, S, T, M, or V; G713 replaced with A, I, L, S, T, 

M, or V; A714 replaced with G, I, L, S, T, M, or V; T715 replaced with A, G, I, L, S, M, or 
V; N716 replaced with Q; 1717 replaced with A, G, L, S, T, M, or V; D718 replaced with 
E; V719 replaced with A, G, I, L, S, T, or M; K720 replaced with H, or R; Q721 replaced 
with N; R722 replaced with H, or K; S723 replaced with A, G, I, L, T, M, or V; H724 

25 replaced with K, or R; G726 replaced with A, I, L, S , T, M, or V; V727 replaced with A, G, 

I, L, S, T, or M; Q728 replaced with N; N729 replaced with Q; D730 replaced with E; G73 1 
replaced with A, I, L, S, T, M, or V; N732 replaced with Q; Y733 replaced with F, or W; 
L734 replaced with A, G, I, S, T, M, or V; A735 replaced with G, I, L, S, T, M, or V; L736 
replaced with A, G, I, S, T, M, or V; K737 replaced with H, or R; T738 replaced with A, G, 

30 I, L, S, M, or V; A739 replaced with G, I, L, S, T, M, or V; D740 replaced with E; G741 

replaced with A, I, L, S, T, M, or V; Q742 replaced with N; Y743 replaced with F, or W; 
L744 replaced with A, G, I, S, T, M, or V; L745 replaced with A, G, I, S, T, M, or V; N746 
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replaced with Q; G747 replaced with A, I, L, S, T, M, or V; N748 replaced with Q; L749 
replaced with A, G, I, S, T, M, or V; A750 replaced with G, I, L, S, T, M, or V; 1751 
replaced with A, G, L, S, T, M, or V; S752 replaced with A, G, I, L, T, M, or V; A753 
replaced with G, I, L, S, T, M, or V; 1754 replaced with A, G, L, S, T, M, or V; E755 
5 replaced with D; Q756 replaced with N; D757 replaced with E; 1758 replaced with A, G, L, 

S, T, M, or V; L759 replaced with A, G, I, S, T, M, or V; V760 replaced with A, G, I, L, S, 
T, or M; K761 replaced with H, or R; G762 replaced with A, I, L, S, T, M, or V; T763 
replaced with A, G, I, L, S, M, or V; 1764 replaced with A, G, L, S, T, M, or V; L765 
replaced with A, G, I, S, T, M, or V; K766 replaced with H, or R; Y767 replaced with F, or 

10 W; S768 replaced with A, G, I, L, T, M, or V; G769 replaced with A, I, L, S, T, M, or V; 

S770 replaced with A, G, I, L, T, M, or V; 1771 replaced with A, G, L, S, T, M, or V; A772 
replaced with G, I, L, S, T, M, or V; T773 replaced with A, G, I, L, S, M, or V; L774 
replaced with A, G, I, S, T, M, or V; E775 replaced with D; R776 replaced with H, or K; 
L777 replaced with A, G, I, S, T, M, or V; Q778 replaced with N; S779 replaced with A, G, 

15 I, L, T, M, or V; F780 replaced with W, or Y; R78 1 replaced with H, or K; L783 replaced 

with A, G, I, S, T, M, or V; E785 replaced with D; L787 replaced with A, G, I, S, T, M, or 
V; T788 replaced with A, G, I, L, S, M, or V; V789 replaced with A, G, I, L, S, T, or M; 
Q790 replaced with N; L791 replaced with A, G, I, S, T, M, or V; L792 replaced with A, G, 
I, S, T, M, or V; T793 replaced with A, G, I, L, S, M, or V; V794 replaced with A, G, I, L, 

20 S, T, or M; G796 replaced with A, I, L, S, T, M, or V; E797 replaced with D; V798 replaced 

with A, G, I, L, S, T, or M; F799 replaced with W, or Y; K802 replaced with H, or R; V803 
replaced with A, G, I, L, S, T, or M; K804 replaced with H, or R; Y805 replaced with F, or 
W; T806 replaced with A, G, I, L, S, M, or V; F807 replaced with W, or Y; F808 replaced 
with W, or Y; V809 replaced with A, G, I, L, S, T, or M; N811 replaced with Q; D812 

25 replaced with E; V813 replaced with A, G, I, L, S, T, or M; D814 replaced with E; F815 

replaced with W, or Y; S816 replaced with A, G, I, L, T, M, or V; M817 replaced with A, 
G, I, L, S, T, or V; Q818 replaced with N; S819 replaced with A, G, I, U T, M, or V; S820 
replaced with A, G, I, L, T, M, or V; K821 replaced with H, or R; E822 replaced with D; 
R823 replaced with H, or K; A824 replaced with G, I, L, S, T, M, or V; T825 replaced with 

30 A, G, I, L, S, M, or V; T826 replaced with A, G, I, L, S, M, or V; N827 replaced with Q; 

1828 replaced with A, G, L, S, T, M, or V; 1829 replaced with A, G, L, S, T, M, or V; Q830 
replaced with N; L832 replaced with A, G, I, S, T, M, or V; L833 replaced with A, G, I, S, 
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T, M, or V; H834 replaced with K, or R; A835 replaced with G, I, L, S, T, M, or V; Q836 
replaced with N; W837 replaced with F, or Y; V838 replaced with A, G, I, L, S, T, or M; 
L839 replaced with A, G, I, S, T, M, or V; G840 replaced with A, I, L, S, T, M, or V; D841 
replaced with E; W842 replaced with F, or Y; S843 replaced with A, G, I, L, T, M, or V; 
5 E844 replaced with D; S846 replaced with A, G, I, L, T, M, or V; S847 replaced with A, G, 

I, L, T, M, or V; T848 replaced with A, G, I, L, S, M, or V; G850 replaced with A, I, L, S, 
T, M, or V; A85 1 replaced with G, I, L, S, T, M, or V; G852 replaced with A, I, L, S, T, M, 
or V; W853 replaced with F, or Y; Q854 replaced with N; R855 replaced with H, or K; 
R856 replaced with H, or K; T857 replaced with A, G, I, L, S, M, or V; V858 replaced with 

10 A, G, I, L, S, T, or M; E859 replaced with D; R861 replaced with H, or K; D862 replaced 

with E; S864 replaced with A, G, I, L, T, M, or V; G865 replaced with A, I, L, S, T, M, or 
V; Q866 replaced with N; A867 replaced with G, I, L, S, T, M, or V; S868 replaced with A, 
G, I, L, T, M, or V; A869 replaced with G, I, L, S, T, M, or V; T870 replaced with A, G, I, 
L, S, M, or V; N872 replaced with Q; K873 replaced with H, or R; A874 replaced with G, 

15 I, L, S, T, M, or V; L875 replaced with A, G, I, S, T, M, or V; K876 replaced with H, or R; 

E878 replaced with D; D879 replaced with E; A880 replaced with G, I, L, S, T, M, or V; 
K881 replaced with H, or R; E884 replaced with D; S885 replaced with A, G, I, L, T, M, or 
V; Q886 replaced with N; L887 replaced with A, G, I, S, T, M, or V; L890 replaced with A, 

G, I, S, T, M, or V. 

20 Also preferred are METH2 polypeptides with one or more of the following 

conservative amino acid substitutions: Ml replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; F2 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; P3 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A4 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; P5 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 

25 Y, or C; A6 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A7 replaced with D, E, H, 

K, R, N, Q, F, W, Y, P, or C; P8 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, or C; R9 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; WIO 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; LI 1 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; P12 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, 
30 Q, F, W, Y, or C; F13 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L14 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; LI 5 replaced with D, E, H, K, R, N, Q, 
F, W, Y, P, or C; L16 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L17 replaced 
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with D, E, H, K, R, N, Q, F, W, Y, P, or C; LI 8 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; L19 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L20 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; L21 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; L22 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P23 replaced with D, E, H, K, 
5 R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L24 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; A25 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R26 replaced with D, 
E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G27 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; A28 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P29 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A30 replaced with D, E, H, K, 
10 R, N, Q, F, W, Y, P, or C; R3 1 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 

P, or C; P32 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A33 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A34 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; G35 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G36 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q37 replaced with D, E, H, K, R, A, G, I, L, S, 

15 T, M, V, F, W, Y, P, or C; A38 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S39 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E40 replaced with H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; L41 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; V42 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V43 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; P44 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 

20 W, Y, or C; T45 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R46 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L47 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; P48 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
G49 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S50 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; A51 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G52 

25 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E53 replaced with H, K, R, A, G, I, L, 

S, T, M, V, N, Q, F, W, Y, P, or C; L54 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; A55 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L56 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; H57 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; L58 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S59 replaced with D, E, 

30 H, K, R, N, Q, F, W, Y, P, or C; A60 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

F61 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G62 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; K63 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 
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F, W, Y, P, or C; G64 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F65 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V66 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; L67 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R68 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L69 replaced with D, E, 

5 H, K, R, N, Q, F, W, Y, P, or C; A70 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

P71 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; D72 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D73 replaced with H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S74 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; F75 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L76 replaced 

10 with D, E, H, K, R, N, Q, F, W, Y, P, or C; A77 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; P78 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E79 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F80 replaced with D, 
E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; K81 replaced with D, E, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, P, or C; 182 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E83 

1 5 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R84 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L85 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; G86 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G87 replaced with 

D, E, H. K, R, N, Q. F, W, Y, P, or C; S88 replaced with D, E, H, K, R, N. Q, F, W, Y, P, 
or C; G89 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R90 replaced with D, E, A, 

20 G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A91 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; T92 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G93 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; G94 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; E95 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R96 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G97 replaced with D, E, H, K, R, 

25 N, Q, F, W, Y, P, or C; L98 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R99 

replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; GlOO replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; ClOl replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, or P; F102 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 
or C; F103 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S104 replaced 

30 with D, E, H, K, R, N, Q, F, W, Y, P, or C; G105 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; T106 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V107 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; N108 replaced with D, E, H, K, R, A, G, I, L, S, T, 
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M, V, F, W, Y, P, or C; G109 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; El 10 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; PI 1 1 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; El 12 replaced with H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S 113 replaced with D, E, H, K, R, N, Q, F, W, 
5 Y, P, or C; LI 14 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Al 15 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; Al 16 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; VI 17 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S 1 18 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; LI 19 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
C120 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; R121 

10 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G122 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; L123 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; S 124 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G125 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; S126 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
F127 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L128 replaced with 

15 D, E, H, K, R, N, Q, F, W, Y, P, or C; L129 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; D130 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G131 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; El 32 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; El 33 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; F134 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 

20 or C; T135 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1136 replaced with D, E, H, 

K, R, N, Q, F, W, Y, P, or C; Q137 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, 
W, Y, P, or C; P138 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or 
C; Q139 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G140 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A141 replaced with D, E, H, K, R, N, 

25 Q, F, W, Y, P, or C; G142 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G143 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S 144 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; L145 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A146 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q147 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, F, W, Y, P, or C; P148 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 

30 V, N, Q, F, W, Y, or C; H149 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 

or C; R150 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L151 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q152 replaced with D, E, H, K, R, A, G, I, L, 
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S, T. M, V, F, W. Y, P, or C; R153 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; W154 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; G155 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P156 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A157 replaced with D, E, H, K, R, N, Q, F, W, Y, 
5 P, or C; G158 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A159 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; R160 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 

F, W, Y, P, or C; P161 replaced with D, E, H, K, R, A, G, I, L, S, T, M. V, N, Q, F, W. Y, 
or C; L162 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P163 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; R164 replaced with D, E, A, G, I, L, 
10 S, T, M, V, N, Q, F, W, Y, P, or C; G165 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; P166 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E167 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; W168 replaced with 

D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; E169 replaced with H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; V170 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

15 C; E17 1 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T172 replaced 

with D, E, H, K, R, N, Q, F, W, Y, P, or C; G173 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; E174 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G175 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q176 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, F, W, Y, P, or C; R177 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 
20 F, W, Y, P, or C; Q178 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 

C; E179 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W. Y, P, or C; R180 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G181 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; D182 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; H183 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

25 C; Q 1 84 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; El 85 replaced 

with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D186 replaced with H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S187 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; E188 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
El 89 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E190 replaced 

30 with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S191 replaced with D, E, H, K, 

R, N, Q, F, W, Y, P, or C; Q192 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, 
Y, P, or C; E193 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E194 
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replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E195 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A196 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; E197 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; G198 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A199 replaced with D, 
5 E, H, K, R, N, Q, F, W, Y, P, or C; S200 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; E201 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P202 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; P203 replaced with D, E, 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; P204 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; P205 replaced with D, E, H, K, R, A, G, I, L, S, T, 
10 M, V, N, Q, F, W, Y, or C; L206 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G207 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A208 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; T209 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S210 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R21 1 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; T212 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

15 C; K213 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R214 replaced 

with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F215 replaced with D, E, H, K, R, 
N, Q, A, G, I, L, S, T, M, V, P, or C; V216 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S217 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E218 replaced with H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A219 replaced with D, E, H, K, R, N, Q, 

20 F, W, Y, P, or C; R220 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 

F221 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V222 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; E223 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; T224 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L225 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L226 replaced with D, E, H, K, R, N, 

25 Q, F, W, Y, P, or C; V227 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A228 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D229 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; A230 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S23 1 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M232 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; A233 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
30 A234 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F235 replaced with D, E, H, K, 

R, N, Q, A, G, I, L, S, T, M, V, P, or C; Y236 replaced with D, E, H, K, R, N, Q, A, G, I, 
L, S, T, M, V, P, or C; G237 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A238 
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replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D239 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; L240 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; Q241 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; N242 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; H243 replaced with 
5 D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 1244 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; L245 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T246 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L247 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; M248 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S249 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V250 replaced with D, E, H, K, R, N, 

10 Q, F, W, Y, P, or C; A251 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A252 

replaced widi D, E, H, K, R, N, Q, F, W, Y, P, or C; R253 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; 1254 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; Y255 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; K256 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H257 replaced with D, E, A, G, 

15 I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P258 replaced with D, E, H, K, R, A, G, I, L, S, T, 

M, V, N, Q, F, W, Y, or C; S259 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1260 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K261 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; N262 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, F, W, Y, P, or C; S263 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1264 replaced 

20 with D, E, H, K, R, N, Q, F, W, Y, P, or C; N265 replaced with D, E, H, K, R, A, G, I, L, 

S, T, M, V, F, W, Y, P, or C; L266 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
M267 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V268 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; V269 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K270 
replaced with D, E. A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V271 replaced with D, 

25 E, H, K, R, N, Q, F, W, Y, P, or C; L272 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; 1273 replaced with D, E, H, K, R, N, Q, F, W, Y. P, or C; V274 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; E275 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; D276 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
E277 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K278 replaced 

30 with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; W279 replaced with D, E, H, K, 

R, N, Q, A, G, I, L, S, T, M, V, P, or C; G280 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; P281 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E282 
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replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V283 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; S284 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; D285 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N286 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G287 replaced with 
5 D, E, H, K, R, N, Q, F, W, Y, P, or C; G288 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; L289 replaced with D, E, H, K, R, N, Q, F, W, Y. P, or C; T290 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; L291 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
R292 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; N293 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, F, W. Y, P, or C; F294 replaced with D, E, H, K, R, 

10 N, Q, A, G, I, L, S, T, M, V, P, or C; C295 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, or P; N296 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, 
P, or C; W297 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Q298 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; R299 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R300 replaced with D, E, A, G, I, L, S, 

15 T, M, V, N, Q, F, W, Y, P, or C; F301 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, 

M, V, P, or C; N302 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
Q303 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; P304 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; S305 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; D306 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, 

20 F, W, Y, P, or C; R307 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 

H308 replaced with D, E, A, G, I. L, S, T, M, V, N, Q, F, W, Y, P, or C; P309 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E3 10 replaced with H, K, R, A, 
G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H31 1 replaced with D, E, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; Y3 12 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 

25 or C; D313 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T314 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A315 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; 13 16 replaced with D, E, H, K, R, N, Q, F. W, Y, P, or C; L3 17 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; L318 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; T319 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R320 replaced with 

30 D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q321 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, F, W, Y, P, or C; N322 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, F, W, Y, P, or C; F323 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or 
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C; C324 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G325 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q326 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, F, W, Y, P, or C; E327 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; G328 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L329 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C330 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D331 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; T332 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L333 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G334 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; V335 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A336 

10 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D337 replaced with H, K, R, A, G, I, 

L, S, T, M, V, N, Q, F, W, Y, P, or C; 1338 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; G339 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T340 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; 1341 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
C342 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D343 

15 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P344 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; N345 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; K346 replaced with D, E, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; S347 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C348 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S349 replaced with 

20 D, E, H, K, R, N, Q, F, W, Y, P, or C; V350 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; 1351 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E352 replaced with H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D353 replaced with H, K, R, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; E354 replaced with H, K, R, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; G355 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L356 

25 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q357 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, F, W, Y, P, or C; A358 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; A359 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H360 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T361 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; L362 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A363 replaced 

30 with D, E, H, K, R, N, Q, F, W, Y, P, or C; H364 replaced with D, E, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, P, or C; E365 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; L366 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G367 replaced with 
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D, E, H, K, R, N, Q, F, W, Y, P, or C; H368 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; V369 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L370 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S371 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; M372 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P373 
5 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; H374 replaced with 

D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D375 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; D376 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; S377 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K378 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P379 replaced with D, E, 

10 H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C380 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or P; T381 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; R382 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L383 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F384 replaced with D, E, H, K, R, N, 
Q, A, G, I, L, S, T, M, V, P, or C; G385 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

15 C; P386 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; M387 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G388 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; K389 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; H390 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; H391 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V392 replaced with D, E, H, K, 

20 R, N, Q, F, W, Y, P, or C; M393 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A394 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P395 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L396 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; F397 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V398 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H399 replaced with D, E, A, G, I, L, 

25 S, T, M, V, N, Q, F, W, Y, P, or C; L400 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; N401 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; Q402 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; T403 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; L404 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; P405 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; W406 

30 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S407 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; P408 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, or C; C409 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
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Y, or P; S410 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A41 1 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; M412 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; Y413 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L414 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; T415 replaced with D, E, H, K, R, N, Q, F, W, 

5 Y, P, or C; E416 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L417 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L418 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; D419 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; G420 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G421 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; H422 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
10 W, Y, P, or C; G423 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D424 replaced 

with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C425 replaced with D, E, H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; L426 replaced with D, E, H, K, R, N, Q, 

F, W, Y, P, or C; L427 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D428 replaced 
with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A429 replaced with D, E, H, 

15 K, R, N, Q, F, W, Y, P, or C; P430 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, 

Q, F, W, Y, or C; G43 1 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A432 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; A433 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; L434 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P435 replaced with 

D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L436 replaced with D, E, H, K, 
20 R, N, Q, F, W, Y, P, or C; P437 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 

F, W, Y, or C; T438 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G439 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; L440 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; P441 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
G442 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R443 replaced with D, E, A, G, 
25 I, L, S, T, M, V, N, Q, F, W, Y, P, or C; M444 replaced with D, E, H, K, R, N, Q, F, W, Y. 

P, or C; A445 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L446 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; Y447 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, 
T, M, V, P, or C; Q448 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 
C; L449 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D450 replaced with H, K, R, 

30 A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q451 replaced with D, E, H, K, R, A, G, I, 

L, S, T, M, V, F, W, Y, P, or C; Q452 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; C453 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
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or P; R454 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Q455 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 1456 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; F457 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, 
P, or C; G458 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P459 replaced with D, 
5 E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; D460 replaced with H, K, R, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F461 replaced with D, E, H, K, R, N, Q, A, G, I, L, 
S, T, M, V, P, or C; R462 replaced with D, E, A. G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; H463 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C464 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; P465 replaced with D, E, H, 

10 K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; N466 replaced with D, E, H, K, R, A, G, 

I, L, S, T, M, V, F, W, Y, P, or C; T467 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; S468 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A469 replaced with D, E, H, 
K, R, N, Q, F, W, Y, P, or C; Q470 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, 
W, Y, P, or C; D47 1 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 

1 5 V472 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C473 replaced with D, E, H, K, 

R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; A474 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; Q475 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
L476 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W477 replaced with D, E, H, K, 
R, N, Q, A, G, I, L, S, T, M, V, P, or C; C478 replaced with D, E, H, K, R, A, G, I, L, S, T, 

20 M, V, N, Q, F, W, Y, or P; H479 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 

P, or C; T480 replaced with D, E, H, K, R, N. Q, F, W, Y, P, or C; D481 replaced with H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G482 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; A483 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E484 
replaced with H, K, R, A, G, I, L, S, T, M, V. N, Q, F, W, Y, P, or C; P485 replaced with 

25 D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L486 replaced with D, E, H, K, 

R, N, Q, F, W, Y, P, or C; C487 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, or P; H488 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T489 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K490 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; N491 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 

30 V, F, W, Y, P, or C; G492 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S493 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L494 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; P495 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
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Y, or C; W496 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; A497 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D498 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; G499 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; T500 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P501 replaced with D, E, 
5 H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C502 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G503 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; P504 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G505 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H506 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; L507 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

10 C; C508 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S509 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E510 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; G51 1 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; S512 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C513 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; L514 replaced with D, E, H, K, R, N, 
15 Q, F, W, Y, P, or C; P5 15 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 

Y, or C; E516 replaced with H, K, R, A, G, 1, L, S, T, M, V, N, Q, F, W, Y, P, or C; E517 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E518 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V519 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; E520 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
20 P, or C; R521 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P522 

replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K523 replaced with 

D, E, A, G, 1, L, S, T, M, V, N, Q, F, W, Y, P, or C; P524 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; V525 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; V526 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D527 replaced with H, 

25 K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G528 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; G529 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W530 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M. V, P, or C; A531 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; P532 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, or C; W533 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 

30 or C; G534 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P535 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; W536 replaced with D, E, H, K, R, N, 
Q, A, G, I, L, S, T, M, V, P, or C; G537 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
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C; E538 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C539 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; S540 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; R541 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, F, or C; T542 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C543 
5 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G544 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; G545 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; G546 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V547 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; Q548 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; F549 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; 
10 S550 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H55 1 replaced with D, E, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R552 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; E553 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; C554 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; K555 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D556 replaced with H, 

15 K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P557 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E558 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; P559 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, or C; Q560 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
N561 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G562 replaced 

20 with D, E, H, K, R, N, Q, F, W, Y, P, or C; G563 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; R564 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y565 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; C566 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; L567 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; G568 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R569 

25 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R570 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A57 1 replaced with D, E, H, K, R, N, Q, 

F, W, Y, P, or C; K572 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
Y573 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Q574 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; S575 replaced with D, E, H, K, R, 

30 N, Q, F, W, Y, P, or C; C576 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 

W, Y, or P; H577 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T578 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E579 replaced with H, K, R, A, G, I, 
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L, S, T, M, V, N, Q, F, W, Y, P, or C; E580 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; C581 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, or P; P582 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; 
P583 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; D584 
5 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G585 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; K586 replaced with D, E, A, G, I, L, S, T, M, V, N, 
Q, F, W, Y, P, or C; S587 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; F588 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; R589 replaced with D, E, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E590 replaced with H, K, R, A, G, I, L, S, T, 

10 M, V, N, Q, F, W, Y, P, or C; Q59 1 replaced with D, E, H, K, R, A. G, I, L, S, T, M, V, F, 

W, Y, P, or C; Q592 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 
C593 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; E594 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K595 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y596 replaced with D, E, H, K, R, N, 

15 Q, A, G, I, L, S, T, M, V, P, or C; N597 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 

F, W, Y, P, or C; A598 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Y599 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; N600 replaced with D, E, H, K, R, 
A, G, I, L, S, T, M, V, F, W, Y, P, or C; Y601 replaced with D, E, H, K, R, N, Q, A, G, I, 
L, S, T, M, V, P, or C; T602 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D603 

20 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; M604 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; D605 replaced with H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, P, or C; G606 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N607 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; L608 replaced with D, 

E. H, K, R, N, Q, F, W, Y, P, or C; L609 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
25 C; Q610 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; W61 1 

replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V612 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; P613 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, or C; K614 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; Y615 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; A616 replaced 
30 with D, E, H, K, R, N, Q, F, W, Y, P, or C; G617 replaced with D, E, H, K, R, N, Q, F, W, 

Y, P, or C; V618 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S619 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; P620 replaced with D, E, H, K, R, A, G, I, L, S, T, 
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M, V, N, Q, F, W, Y, or C; R621 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, 
P, or C; D622 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; R623 
replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C624 replaced with D, 
E, H, K, R, A, G, 1, L, S, T, M, V, N, Q, F, W, Y, or P; K625 replaced with D, E, A, G, I, 
5 L, S, T, M, V, N, Q, F, W, Y, P, or C; L626 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; F627 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; C628 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; R629 replaced with D, E, 
A, G, I. L, S, T, M, V, N, Q, F, W, Y, P, or C; A630 replaced with D, E, H, K, R, N, Q, F, 
W, Y, P, or C; R631 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 
10 G632 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R633 replaced with D, E, A, G, 

I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S634 replaced with D, E, H, K, R, N, Q, F, W, Y, 
P, or C; E635 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F636 
replaced with D, E, H, K, R, N, Q, A, G, I, L; S, T, M, V, P, or C; K637 replaced with D, 

E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V638 replaced with D, E, H, K, R, N, Q, 
15 F, W, Y, P, or C; F639 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; 

E640 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A641 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; K642 replaced with D, E, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; V643 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1644 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D645 replaced with H, K, R, A, G, I, 

20 L, S, T, M, V, N, Q, F, W, Y, P, or C; G646 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; T647 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L648 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; C649 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, or P; G650 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P651 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E652 replaced with 

25 H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; T653 replaced with D, E, H, K, R, 

N, Q, F, W, Y, P, or C; L654 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A655 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1656 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; C657 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or P; V658 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; R659 replaced with D, 

30 E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G660 replaced with D, E, H, K, R, N, Q, 

F, W, Y, P, or C; Q661 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or 
C; C662 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; V663 
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replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K664 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; A665 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; G666 replaced widi D, E, H, K, R, N, Q, F, W, Y, P, or C; C667 replaced with D, E, H, 
K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; D668 replaced with H, K, R, A, G, I, L, 
5 S, T, M, V, N, Q, F, W, Y, P, or C; H669 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, 

F, W, Y, P. or C; V670 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V671 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; D672 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; S673 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; P674 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; R675 replaced with 

10 D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K676 replaced with D, E, A, G, I, L, S, 

T, M, V, N, Q, F, W, Y, P, or C; L677 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
D678 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K679 replaced 
with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; C680 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G681 replaced with D, E, H, K, R, N, Q, F, 

15 W, Y, P, or C; V682 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C683 replaced 

with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G684 replaced with D, E, 
H, K, R, N, Q, F, W, Y, P, or C; G685 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
K686 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G687 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; N688 replaced with D, E, H, K, R, A, G, I, L, S, T, 

20 M, V, F, W, Y, P, or C; S689 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C690 

replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; R691 replaced with 

D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; K692 replaced with D, E, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, P, or C; V693 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
S694 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G695 replaced with D, E, H, K, 

25 R, N, Q, F, W, Y, P, or C; S696 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L697 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T698 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; P699 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, or C; T700 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N701 replaced with D, 

E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; Y702 replaced with D, E, H, K, R, N, 
30 Q, A, G, I, L, S, T, M, V, P, or C; G703 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 

C; Y704 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; N705 replaced 
with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; D706 replaced with H, K, R, 
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A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 1707 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or C; V708 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T709 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; 1710 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; P711 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; A712 
5 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G7 1 3 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; A714 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T715 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N716 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, F, W, Y, P, or C; 1717 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; D718 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; V719 
10 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K720 replaced with D, E, A, G, I, L, 

S, T, M, V, N, Q, F, W, Y, P, or C; Q721 replaced with D, E, H, K, R, A, G, I, L, S, T, M, 
V, F, W, Y, P, or C; R722 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; S723 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; H724 replaced with D, E, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P725 replaced with D, E, H, K, R, A, G, I, L, S, 
15 T, M, V, N, Q, F, W, Y, or C; G726 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 

V727 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q728 replaced with D, E, H, K, 
R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; N729 replaced with D, E, H, K, R, A, G, I, L, S, 
T, M, V, F, W, Y, P, or C; D730 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; G731 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N732 replaced with 
20 D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; Y733 replaced with D, E, H, K, R, 

N, Q, A, G, I, L, S, T, M, V, P, or C; L734 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; A735 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L736 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; K737 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; T738 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A739 replaced 

25 with D, E, H, K, R, N, Q, F, W, Y, P, or C; D740 replaced with H, K, R, A, G, I, L, S, T, M, 

V, N, Q, F, W, Y, P, or C; G741 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q742 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; Y743 replaced with 
D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; L744 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; L745 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; N746 

30 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; G747 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; N748 replaced with D, E, H, K, R, A, G, I, L, S, T, 
M, V, F, W, Y, P, or C; L749 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A750 
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replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1751 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; S752 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A753 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1754 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; E755 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
5 or C; Q756 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; D757 

replaced with H, K, R, A, G, I, L, S , T, M, V, N, Q, F, W, Y, P, or C; 1758 replaced with D, 
E, H, K, R, N, Q, F, W, Y, P, or C; L759 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; V760 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K761 replaced with D, E, A, 
G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; G762 replaced with D, E, H, K, R, N, Q, F, W, 

10 Y, P, or C; T763 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1764 replaced with D, 

E, H, K, R, N, Q, F, W, Y, P, or C; L765 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; K766 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y767 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S768 replaced with D, E, H, K, R, 
N, Q, F, W, Y, P, or C; G769 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S770 

15 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 177 1 replaced with D, E, H, K, R, N, 

Q, F, W, Y, P, or C; A772 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T773 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L774 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; E775 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; R776 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; L777 replaced 

20 with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q778 replaced with D, E, H, K, R, A, G, I, L, 

S, T, M, V, F, W, Y, P, or C; S779 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
F780 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; R781 replaced with 
D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P782 replaced with D, E, H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L783 replaced with D, E, H, K, R, N, Q, F, W, Y, 
25 P, or C; P784 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E785 

replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P786 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L787 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; T788 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V789 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q790 replaced with D, E, H, K, R, A, 
30 G, I, L, S, T, M, V, F, W, Y, P, or C; L791 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

or C; L792 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T793 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; V794 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 
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P795 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; G796 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; E797 replaced with H, K, R, A, G, I, 
L, S, T, M, V, N, Q, F, W, Y, P, or C; V798 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; F799 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; P800 replaced 
5 with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; P801 replaced with D, E, 

H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; K802 replaced with D, E, A, G, I, L, 
S, T, M, V, N, Q, F, W, Y, P, or C; V803 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or 
C; K804 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; Y805 replaced 
with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; T806 replaced with D, E, H, K, R, 

10 N, Q, F, W, Y, P, or C; F807 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, 

or C; F808 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; V809 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; P810 replaced with D, E, H, K, R, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, or C; N81 1 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, 
W, Y, P, or C; D812 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; 

15 V813 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; D814 replaced with H, K, R, A, 

G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; F815 replaced with D, E, H, K, R, N, Q, A, G, 

I, L, S, T, M, V, P, or C; S816 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; M817 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q818 replaced with D, E, H, K, R, A, 
G, I, L, S, T, M, V, F, W, Y, P, or C; S819 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

20 or C; S820 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; K82 1 replaced with D, E, 

A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; E822 replaced with H, K, R, A, G, I, L, S, T, 
M, V, N, Q, F, W, Y, P, or C; R823 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, 
Y, P, or C; A824 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T825 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; T826 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 

25 or C; N827 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; 1828 

replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; 1829 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; Q830 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 
or C; P831 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L832 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L833 replaced with D, E, H, K, R, N, 

30 Q, F, W, Y, P, or C; H834 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 

C; A835 replaced with D, E, H, K, R, N, Q, F, W, Y. P, or C; Q836 replaced with D, E, H, 
K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; W837 replaced with D, E, H, K, R, N, Q, A, 
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G, I, L, S, T, M, V, P, or C; V838 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L839 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; G840 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; D841 replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; W842 replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; S843 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; E844 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; C845 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 
F, W, Y, or P; S846 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; S847 replaced with 

D, E, H, K, R, N, Q, F, W, Y, P, or C; T848 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; C849 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; G850 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; A851 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; G852 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; W853 
replaced with D, E, H, K, R, N, Q, A, G, I, L, S, T, M, V, P, or C; Q854 replaced with D, 

E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; R855 replaced with D, E, A, G, I, L, S, 
T, M, V, N, Q, F, W, Y, P, or C; R856 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, 
W, Y, P, or C; T857 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; V858 replaced 
with D, E, H, K, R, N, Q, F, W, Y, P, or C; E859 replaced with H, K, R, A, G, I, L, S, T, M, 
V, N, Q, F, W, Y, P, or C; C860 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, 

F, W, Y, or P; R86 1 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D862 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; P863 replaced with 
D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; S864 replaced with D, E, H, K, 
R, N, Q, F, W, Y, P, or C; G865 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; Q866 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, or C; A867 replaced with 
D, E, H, K, R, N, Q, F, W, Y, P, or C; S868 replaced with D, E, H, K, R, N, Q, F, W, Y, P, 
or C; A869 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; T870 replaced with D, E, 

H, K, R, N, Q, F, W, Y, P, or C; C871 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, 
N, Q, F, W, Y, or P; N872 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, P, 
or C; K873 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A874 
replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; L875 replaced with D, E, H, K, R, N, 
Q, F, W, Y, P, or C; K876 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or 
C; P877 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; E878 
replaced with H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; D879 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; A880 replaced with D, E, H, K, R, 



-191- 



N, Q, F, W, Y, P, or C; K881 replaced with D, E, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, 
or C; P882 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; C883 
replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; E884 replaced with 
H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, P, or C; S885 replaced with D, E, H, K, R, 
5 N, Q, F, W, Y, P, or C; Q886 replaced with D, E, H, K, R, A, G, I, L, S, T, M, V, F, W, Y, 

P, or C; L887 replaced with D, E, H, K, R, N, Q, F, W, Y, P, or C; C888 replaced with D, 
E, H, K, R, A, G, I, L, S, T, M, V, N, Q, F, W, Y, or P; P889 replaced with D, E, H, K, R, 
A, G, I, L, S, T, M, V, N, Q, F, W, Y, or C; L890 replaced with D, E, H, K, R, N, Q, F, W, 
Y, P, or 

10 METHl or METH2 polypeptides may contain 50, 40, 30, 10, 20, 19, 18, 17, 16, 15, 

14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative or non-conservative amino acid 
substitutions. Additionally, METH 1 or METH2 polypeptides may contain both conservative 
or non-conservative substitutions, in any combination. A METHl or METH2 polypeptide 
may contain 50, 40, 30, 10, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 

15 1 conservative amino acids substitutions, and 50, 40, 30, 10, 20, 19, 18, 17, 16, 15, 14, 13, 

12, 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 non-conservative amino acid substitutions in the same 
polypeptide. For example, a particular polypeptide may contain 10 conservative amino acid 
substitutions and 10 non-conservative amino acid substitutions. Polynucleotides encoding 
such METHl or METH2 polypeptides with substitutions are also encompassed within the 

20 present invention. 

The substitutions may be made in full-length METHl or METH2, mature METHl 
or METH2, and any other METHl or METH2 variant disclosed herein, including METHl 
or METH2 polypeptides with N- and/or C-terminal amino acid deletions; METHl or 
METH2 polypeptides which lack one or more domains; or hybrid METH1/METH2 

25 molecules. 

Amino acids in the METHl and METH2 proteins of the present invention that are 
essential for function can be identified by methods known in the art, such as site-directed 
mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244:10S1- 
1085 (1989)), The latter procedure introduces single alanine mutations at every residue in 

30 the molecule. The resulting mutant molecules are then tested for biological activity such as 

in vitro or in vivo inhibition of angiogenesis. Sites that are critical for inhibition of 
angiogenesis can also be determined by structural analysis such as crystallization, nuclear 
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magnetic resonance or photoaffinity labeling (Smith et al, J. Mol. Biol. 224:899-904 (1 992) 
and de Vos et al, Science 255:306-312 (1992)). 

Particularly preferred are polypeptides with amino acid substitutions at the 
boundaries of each domain (for example, at the boundary of the metalloprotease domain). 
5 Amino acid substitutions at these boundaries may be made to change the activity of the 

protein, for example, to prevent cleavage. Amino acid substitutions may also be made 
which do not affect the activity of the protein. For example, the following amino acids may 
be replaced in METHl, with the following amino acids: L-19 may be replaced with A, C, 

D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-20 may be replaced with A, C, D, E, 
10 F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-21 may be replaced with A, C, D, E, F, G, 

H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-22 may be replaced with A, C, D, E, F, G, H, I, 
K, M, N, P, Q, R, S, T, V, W or Y; A-23 may be replaced with may be replaced with C, D, 

E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-24 may be replaced with C, D, E, F, 
G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-25 may be replaced with C, D, E, F, G, H, 

15 I, K, L, M, N, P, Q, R, S, T, V, W or Y; L-26 may be replaced with A, C, D, E, F, G, H, I, 

K, M, N, P, Q, R, S, T, V, W or Y; L-27 may be replaced with A, C, D, E, F, G, H, I, K, M, 
N, P, Q, R, S, T, V, W or Y; A-28 may be replaced with C, D, E, F, G, H, I, K, L, M, N, P, 
Q, R, S, T, V, W or Y; V-29 maybe replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, 
R, S, T, W or Y; S-30 may be replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, 

20 V, W or Y; D-3 1 may be replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W 

or Y; A-32 may be replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; 
L-33 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; G-34 
may be replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-35 may 
be replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; P-36 may be 

25 replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; S-37 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; E-38 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-39 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-225 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; T-226 may be 

30 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; G-227 may be 

replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-228 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; G-229 may be 
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replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-230 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; 1-231 may be 
replaced with A, C, D, E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; R-232 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; K-233 may be 

5 replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; K-234 may be 

replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; R-235 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; F-236 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; V-237 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; S-238 may be 

10 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; S-239 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; H-240 may be 
replaced with A, C, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-241 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; Y-242 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W; V-243 may be 

15 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; E-244 may be 

replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-245 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; K-449 may be 
replaced with A, C, D, E, F, G, H, I, L, M. N, P, Q, R, S, T, V, W or Y; P-450 may be 
replaced with A, C, D, E, F. G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; Q-451 may be 

20 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; N-452 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; P-453 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; 1-454 may be 
replaced with A, C, D, E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; Q-455 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; L-456 may be 

25 replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-457 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; G-458 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; D-459 may be 
replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; L-460 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-461 may be 

30 replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; G-462 may be 

replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-463 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; S-464 may be 



-194- 



replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; Y-465 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or W; D-466 may be 
replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-467 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; N-468 may be 

5 replaced with A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; R-469 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; R-534 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; K-535 may be 
replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; H-536 may be 
replaced with A, C, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W or Y; F-537 may be 

10 replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; D-538 may be 

replaced with A. C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-539 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; P-540 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; F-541 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; H-542 may be 

15 replaced with A, C, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W or Y; G-543 may be 

replaced with A, C, D, E, F, H, 1, K, L, M, N, P, Q, R, S, T, V, W or Y; S-544 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; W-545 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; G-546 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; M-547 may be 

20 replaced with A, C, D, E, F, G, H, I, K, L, N, P, Q, R, S, T, V, W or Y; W-548 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; G-549 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-550 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; W-551 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; G-552 may be 

25 replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; D-553 may be 

replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; G-554 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-831 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; F-832 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; N-833 may be 

30 replaced with A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; A-834 may be 

replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; 1-835 may be 
replaced with A, C, D, E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; P-836 may be 
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replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; T-837 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; F-838 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-839 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; A-840 may be 

5 replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; W-841 may be 

replaced with A, C, D, E, F, G, H, I, K, L. M, N, P, Q, R, S, T, V or Y; V-842 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; 1-843 may be 
replaced with A, C, D, E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; E-844 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-845 may be 

10 replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; W-846 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; G-847 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-848 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; C-849 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-850 may be 

15 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; K-851 may be 

replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; R-885 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; P-886 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; C-887 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-888 may be 

20 replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; D-889 may be 

replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; H-890 may be 
replaced with A, C, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-891 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; C-892 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-893 may be 

25 replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; Q-894 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; W-895 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; Q-896 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; L-897 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; G-898 may be 

30 replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-899 may be 

replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; W-900 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; S-901 may be 
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replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; S-902 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; C-903 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-904 may be 
replaced with A, C, D, E, F, G, H. I, K, L, M, N, P, Q, R, T, V, W or Y; and/or K-905 may 
5 be replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y. 

In addition, the following amino acids may be replaced in METH2 with the 
following amino acids: L-14 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, 
S, T, V, W or Y; L-15 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, 
V, W or Y; L-16 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W 

10 or Y; L-17 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; 

L-18 may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-19 
may be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-20 may 
be replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-21 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-22 may be 

15 replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-23 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; L-24 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; A-25 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-26 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; G-27 may be 

20 replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-28 may be 

replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-29 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; A-30 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-31 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; P-32 may be 

25 replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; A-33 may be 

replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-34 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-204 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; P-205 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; L-206 may be 

30 replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; G-207 may be 

replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-208 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-209 may be 
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replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; S-210 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; R-21 1 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; T-212 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; K-213 may be 

5 replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; R-214 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; F-215 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; V-216 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; S-217 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; E-218 may be 

10 replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-219 may be 

replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-220 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; F-221 may be 
replaced with A, C, D, E, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; V-222 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; E-223 may be 

15 replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; T-224 may be 

replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; P-430 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; G-431 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-432 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-433 may be 

20 replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; L-434 may be 

replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-435 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; L-436 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-437 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; T-438 may be 
. 25 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, V, W or Y; G-439 may be 

replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; L-440 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; P-441 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; G-442 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-443 may be 

30 replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; M-444 may be 

replaced with A, C, D, E, F, G, H, I, K, L, N, P, Q, R, S, T, V, W or Y; A-445 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; L-446 may be 
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replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; Y-447 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V or W; Q-448 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; L-449 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; D-450 may be 
replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-520 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; R-521 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, S, T, V, W or Y; P-522 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; K-523 may be 
replaced with A, C, D, E, F, G, H, I, L, M, N, P, Q, R, S, T, V, W or Y; P-524 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; V-525 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; V-526 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; D-527 may be 
replaced with A, C, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; G-528 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; G-529 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; W-530 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; A-531 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-532 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; W-533 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; G-534 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; P-535 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; W-536 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V or Y; G-537 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; E-538 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; C-539 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-540 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; N-827 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, P, Q, R, S, T, V, W or Y; 1-828 may be 
replaced with A, C, D. E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; 1-829 may be 
replaced with A, C, D, E, F, G, H, K, L, M, N, P, Q, R, S, T, V, W or Y; Q-830 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; P-831 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, Q, R, S, T, V, W or Y; L-832 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; L-833 may be 
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replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; H-834 may be 
replaced with A, C, D, E, F, G, I, K, L, M, N, P, Q, R, S, T, V, W or Y; A-835 may be 
replaced with C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; Q-836 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, R, S, T, V, W or Y; W-837 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, or Y; V-838 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, W or Y; L.839 may be 
replaced with A, C, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, V, W or Y; G-840 may be 
replaced with A, C, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; D-841 may be 
replaced with A, C, E, F, G, H, I, K, L, M, N, P. Q, R, S, T, V, W or Y; W-842 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V or Y; S-843 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; E-844 may be 
replaced with A, C, D, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; C-845 may be 
replaced with A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W or Y; S-846 may be 
replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y; and/or S-847 may 
be replaced with A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W or Y. 

METHl or METH2 polypeptide variants, including substitution, deletion and/or 
addition variants, which contain amino acid substitutions can be tested for activity in any of 
the assays described herein, for example, the chorioallantoic assay or the cornea pocket 
assay. Preferred are METHl or METH2 polypeptides with conservative substitutions that: 
maintain all the activities and/or properties of the wild type protein; or have one or more 
enhanced activities and/or properties compared to the wild type protein. Also preferred are 
METHl or METH2 polypeptides withnonconservative substitutions which: lack an activity 
and/or property of the wild type protein, while maintaining all other activities and/or 
properties; or lack more than one activity and/or property of the wild type protein. 

For example, activities or properties of METHl or METH2 that may be altered in 
METHl or METH2 polypeptides with conservative or nonconservative substitutions 
include, but are not limited to: stimulation of angiogenesis; stimulation of epithelial cell 
prohferation; antibody binding; ligand binding; stability; solubility; and/or properties which 
affect purification. 

The polypeptides of the present invention are preferably provided in an isolated form. 
By "isolated polypeptide" is intended a polypeptide removed from its native environment. 
Thus, a polypeptide produced and/or contained within a recombinant host cell is considered 
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isolated for purposes of the present invention. Also intended as an "isolated polypeptide" 
are polypeptides that have been purified, partially or substantially, from a recombinant host 
cell or from a native source. For example, a recombinantly produced version of the METH 1 
or METH2 polypeptide can be substantially purified by the one-step method described in 

5 Smith and Johnson, Gene 67:31-40 (1988). 

The polypeptides of the present invention include the METHl polypeptide encoded 
by the deposited cDNA including the leader; the mature METHl polypeptide encoded by 
the deposited the cDNA minus the leader (i.e., the mature protein); apolypeptide comprising 
amino acids about 1 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 

0 about 2 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids about 29 to 

about 950 in SEQ ID NO:2; a polypeptide comprising amino acids about 30 to about 950 
in SEQ ID NO:2; apolypeptide comprising the metalloprotease domain of METHl, amino 
acids 235 to 459 in SEQ ID NO:2; a polypeptide comprising the disintegrin domain of 
METHl, amino acids 460 to 544 in SEQ ID NO:2; a polypeptide comprising the first TSP- 

5 like domain of METHl, amino acids 545 to 598 in SEQ ID NO:2; a polypeptide comprising 

the second TSP-like domain of METHl, amino acids 841 to 894 in SEQ ID NO:2; a 
polypeptide comprising the third TSP-like domain of METHl, amino acids 895 to 934 in 
SEQ ID NO:2; a polypeptide comprising amino acids 536 to 613 in SEQ ID NO:2; a 
polypeptide comprising amino acids 549 to 563 in SEQ ED NO:2; the METH2 polypeptide 

0 encoded by the deposited cDNA including the leader; the mature METH2 polypeptide 

encoded by the deposited the cDNA minus the leader (i.e., the mature protein); a polypeptide 
comprising amino acids about 1 to about 890 in SEQ ID NO:4; a polypeptide comprising 
amino acids about 2 to about 890 in SEQ ID NO:4; a polypeptide comprising amino acids 
about 24 to about 890 in SEQ ID NO:4; a polypeptide comprising amino acids about 1 12 

5 to about 890 in SEQ ID NO:4; a polypeptide comprising the metalloprotease domain of 

METH2, amino acids 214 to 439 in SEQ ID NO:4; apolypeptide comprising the disintegrin 
domain of METH2, amino acids 440 to 529 in SEQ ID NO:4; a polypeptide comprising the 
first TSP-like domain of METH2, amino acids 530 to 583 in SEQ ID NO:4; a polypeptide 
comprising the second TSP-like domain of METH2, amino acids 837 to 890 in SEQ ID 

) NO:4; a polypeptide comprising amino acids 280 to 606 in SEQ ID NO:4; a polypeptide 

comprising amino acids 529 to 548 in SEQ ID NO:4; as well as polypeptides which are at 
least 95% identical, and more preferably at least 96%, 97%, 98% or 99% identical to the 
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polypeptides described above and also include portions of such polypeptides with at least 
30 amino acids and more preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a reference amino acid sequence of a METHl or METH2 polypeptide is 
intended that the amino acid sequence of the polypeptide is identical to the reference 
sequence except that the polypeptide sequence may include up to five amino acid alterations 
per each 100 amino acids of the reference amino acid of the METHl or METH2 
polypeptide. In other words, to obtain a polypeptide having an amino acid sequence at least 
95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in 
the reference sequence may be deleted or substituted with another amino acid, or a number 
of amino acids up to 5% of the total amino acid residues in the reference sequence may be 
inserted into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among residues in the 
reference sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 95%, 96%, 97%, 
98% or 99% identical to, for instance, the amino acid sequence shown in SEQ ID NO:2 or 
SEQ ID NO:4 or to the amino acid sequence encoded by deposited cDNA clones can be 
determined conventionally using known computer programs such the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, 
University Research Park, 575 Science Drive, Madison, WI 5371 1), When using Bestfit or 
any other sequence alignment program to determine whether a particular sequence is, for 
instance, 95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated over the full 
length of the reference amino acid sequence and that gaps in homology of up to 5% of the 
total number of amino acid residues in the reference sequence are allowed. 

A preferred method for determining the best overall match between a query sequence 
(a sequence of the present invention) and a subject sequence, also referred to as a global 
sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag etal, Comp. App, Bioscl 6:237-245 (1990). In a sequence alignment, 
the query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. Preferred 
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parameters used in a FASTDB amino acid alignment are: Matrix:=PAM 0, k-tuple=2. 
Mismatch Penalty^ 1 , Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l , 
Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject 
amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, a manual correction must be made to the results. 
This is because the FASTDB program does not account for N- and C-terminal truncations 
of the subject sequence when calculating global percent identity. For subject sequences 
truncated at the N- and C-termini, relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C- 
terminal of the subject sequence, which are not matched/aligned with a corresponding 
subject residue, as a percent of the total residues of the query sequence. Whether a residue 
is matched/aligned is determined by the results of the FASTDB sequence alignment. This 
percentage is then subtracted from the percent identity, calculated by the above FASTDB 
program using the specified parameters, to arrive at a final percent identity score. This final 
percent identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C-termini of the subject sequence, which are not matched/aligned 
with the query sequence, are considered for the purposes of manually adjusting the percent 
identity score. That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue 
query sequence to determine percent identity. The deletion occurs at the N-terminus of the 
subject sequence and therefore, the FASTDB alignment does not show a match/alignment 
of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the 
sequence (number of residues at the N- and C-termini not matched/total number of residues 
in the query sequence) so 10% is subtracted from the percent identity score calculated by the 
FASTDB program. If the remaining 90 residues were perfectly matched, the final percent 
identity would be 90%. Li another example, a 90 residue subject sequence is compared with 
a 100 residue query sequence. This time, the deletions are internal, so there are no residues 
at the - or C-termini of the subject sequence which are not matched/aligned with the query. 
In this case, the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the - and C-terminal ends of the subject sequence, as 
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displayed in the FASTDB alignment, which are not matched/aligned with the query 
sequence are manually corrected for. No other manual corrections are made for the purposes 
of the present invention. 

The polypeptides of the present invention are useful as a molecular weight marker 
5 on SDS-PAGE gels or on molecular sieve gel filtration columns using methods well known 

to those of skill in the art. 

In another aspect, the invention provides a peptide or polypeptide comprising an 
epitope-bearing portion of a polypeptide of the invention. The epitope of this polypeptide 
portion is an immunogenic or antigenic epitope of a polypeptide described herein. An 
,J0 "immunogenic epitope" is defined as a part of a protein that elicits an antibody response 

^ when the whole protein is the immunogen. On the other hand, a region of a protein 

si molecule to which an antibody can bind is defined as an "antigenic epitope." The number 

:S of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes . 

^JJ See, for instance, Geysen et al, Proc. Natl Acad. ScL USA Si:3998-4002 (1983). 

=i 15 As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that 

contain a region of a protein molecule to which an antibody can bind), it is well known in 
the art that relatively short synthetic peptides that mimic part of a protein sequence are 
a routinely capable of eliciting an antiserum that reacts with the partially mimicked protein. 

See, for instance, Sutcliffe, J. G. et al, "Antibodies that react with predetermined sites on 
20 proteins", Science 219:660-666 (1983). Peptides capable of eliciting protein-reactive sera 

are frequently represented in the primary sequence of a protein, can be characterized by a 
set of simple chemical rules, and are confined neither to immunodominant regions of intact 
proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl terminals. 

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore 
25 useful to raise antibodies, including monoclonal antibodies, that bind specifically to a 

polypeptide of the invention. See, for instance, Wilson et aL, Cell 37:767-778 (1984) at 
777. 

Antigenic epitope-bearing peptides and polypeptides of the invention preferably 
contain a sequence of at least seven, more preferably at least nine and most preferably 
30 between about at least about 15 to about 30 amino acids contained within the amino acid 

sequence of a polypeptide of the invention. 
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The epitope-bearing peptides and polypeptides of the invention may be produced by 
any conventional means. Houghten, R. A., "General method for the rapid solid-phase 
synthesis of large numbers of peptides: specificity of antigen-antibody interaction at the level 
of individual amino acids", Proc, Natl Acad. Set USA 82:5131-5135 (1985). This 
5 "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. 

Patent No. 4,631,21 1 to Houghten et al (1986). 

As one of skill in the art will appreciate, METHl or METH2 polypeptides of the 
present invention and the epitope-bearing fragments thereof described above can be 
combined with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
,^...10 polypeptides. These fusion proteins facilitate purification and show an increased half-life 

f\ in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains 

of the human CD4-polypeptide and various domains of the constant regions of the heavy or 
J5 light chains of mammalian immunoglobulins (EPA 394,827; Traunecker et al, Nature 

331 : 84- 86 ( 1 988)) . Fusion proteins that have a disulfide-linked dimeric structure due to the 
s 15 IgG part can also be more efficient in binding and neutralizing other molecules than the 

m monomeric METHl or METH2 protein or protein fragment alone (Fountoulakis et al, 7. 

Biochem, 270:3958-3964 (1995)). 

METHl and METHl Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short polynucleotide 
20 having a nucleic acid sequence contained in the deposited clones or shown in SEQ ID NO: 1 

or SEQ ID NO:3. The short nucleotide fragments are preferably at least about 15 nt, and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably, at least about 40 nt in length. A fragment "at least 20 nt in length," for example, 
is intended to include 20 or more contiguous bases from the cDNA sequence contained in 
25 the deposited clones or the nucleotide sequence shown in SEQ ID NO: 1 or SEQ ID NO:3. 

These nucleotide fragments are useful as diagnostic probes and primers as discussed herein. 
Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 nucleotides) are preferred. 

Moreover, representative examples of METH 1 or METH2 polynucleotide fragments 
include, for example, fragments having a sequence from about nucleotide number 1-50, 51- 
30 100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401-450, 451-500, 501-550, 
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551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 901-950, 951-1000, 1001-1050, 
1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251-1300, 1301-1350, 1351-1400, 1401- 
1450, 1451-1500, 1501-1550, 1551-1600, 1601-1650, 165M700, 1701-1750, 1751-1800, 
1 80 1 - 1 850, 1 85 1 - 1 900, 1 90 1 - 1 950, 1 95 1 -2000, or 200 1 to the end of SEQ ED NO: 1 or SEQ 
5 ID NO:3 or the cDNA contained in the deposited clones. In this context "about" includes 

the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at 
either terminus or at both termini. Preferably, these fragments encode a polypeptide which 
has biological activity. More preferably, these polynucleotides can be used as probes or 
primers as discussed herein. 
3 10 In the present invention, a "polypeptide fragment" refers to a short amino acid 

^ sequence contained in SEQ ID NO:2 or SEQ ID NO:4 or encoded by the cDNA contained 

in the deposited clones. Protein fragments may be "free-standing," or comprised within a 
0 larger polypeptide of which the fragment forms a part or region, most preferably as a single 

continuous region. Representative examples of polypeptide fragments of the invention, 
1 5 include, for example, fragments from about amino acid number 1 -20, 2 1 -40, 4 1 -60, 6 1 -80, 

0 81-100, 102-120, 121-140, 141-160, 161-180, 181-200, 201-220, 221-240, 241-260, 261- 

Jj 280, or 28 1 to the end of the coding region of SEQ ID NO:2 or SEQ ID NO:4. Moreover, 

J polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 1 10, 120, 130, 140, 

or 150 amino acids in length. In this context "about" includes the particularly recited ranges, 
20 larger or smaller by several (5, 4, 3, 2, or 1) amino acids, at either extreme or at both 

extremes. 

Preferred polypeptide fragments include the secreted METH 1 or METH2 protein as 
well as the mature form. Further preferred polypeptide fragments include the secreted 
METHl or METH2 protein or the mature form having a continuous series of deleted 

25 residues from the amino or the carboxy terminus, or both. For example, any number of 

amino acids, ranging from 1-60, can be deleted from the amino terminus of either the 
secreted METHl or METH2 polypeptide or the mature form. Similarly, any number of 
amino acids, ranging from 1-30, can be deleted from the carboxy terminus of the secreted 
METHl or METH2 protein or mature form. Furthermore, any combination of the above 

30 amino and carboxy terminus deletions are preferred. Similarly, polynucleotide fragments 

encoding these METHl or METH2 polypeptide fragments are also preferred. 
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Particularly, N-terminal deletions of the METHl polypeptide can be described by 
the general foraiula m-950, where m is an integer from 2 to 949, where m corresponds to the 
position of the amino acid residue identified in SEQ ID NO:2. Preferably, N-terminal 
deletions of the METHl polypeptide of the invention shown as SEQ ID NO:2 include 
5 polypeptides comprising the amino acid sequence of residues: G-2 to S-950; N-3 to S-950; 

A-4 to S-950; E-5 to S-950; R-6 to S-950; A-7 to S-950; P-8 to S-950; G-9 to S-950; S-10 
to S-950; R-1 1 to S-950; S-12 to S-950; F-13 to S-950; G-14 to S-950; P-15 to S-950; V-16 
to S-950; P-17 to S-950; T-18 to S-950; L-19 to S-950; L-20 to S-950; L-21 to S-950; L-22 
to S-950; A-23 to S-950; A-24 to S-950; A-25 to S-950; L-26 to S-950; L-27 to S-950; A-28 

10 to S-950; V-29 to S-950; S-30 to S-950; D-3 1 to S-950; A-32 to S-950; L-33 to S-950; G-34 

to S-950; R-35 to S-950; P-36 to S-950; S-37 to S-950; E-38 to S-950; E-39 to S-950; D-40 
to S-950; E-41 to S-950; E-42 to S-950; L-43 to S-950; V-44 to S-950; V-45 to S-950; P-46 
to S-950; E-47 to S-950; L-48 to S-950; E-49 to S-950; R-50to S-950; A-51 to S-950; P-52 
to S-950; G-53 to S-950; H-54 to S-950; G-55 to S-950; T-56 to S-950; T-57 to S-950; R-58 

15 to S-950; L-59 to S-950; R-60 to S-950; L-61 to S-950; H-62 to S-950; A-63 to S-950; F-64 

to S-950; D-65 to S-950; Q-66 to S-950; Q-67 to S-950; L-68 to S-950; D-69 to S-950; L-70 
to S-950; E-71 to S-950; L-72 to S-950; R-73 to S-950; P-74 to S-950; D-75 to S-950; S-76 
to S-950; S-77 to S-950; F-78 to S-950; L-79 to S-950; A-80 to S-950; P-81 to S-950; G-82 
to S-950; F-83 to S-950; T-84 to S-950; L-85 to S-950; Q-86 to S-950; N-87 to S-950; V-88 

20 to S-950; G-89 to S-950; R-90 to S-950; K-91 to S-950; S-92 to S-950; G-93 to S-950; S-94 

to S-950; E-95 to S-950; T-96 to S-950; P-97 to S-950; L-98 to S-950; P-99 to S-950; E-lOO 
to S-950; T-lOl to S-950; D-102 to S-950; L-103 to S-950; A-104 to S-950; H-105 to S- 
950; C-106 to S-950; F-107 to S-950; Y-108 to S-950; S-109 to S-950; G-1 10 to S-950; T- 
1 1 1 to S-950; V-112 to S-950; N-1 13 to S-950; G-1 14 to S-950; D-1 15 to S-950; P-1 16 to 

25 S-950; S-117 to S-950; S-118 to S-950; A-119 to S-950; A-120 to S-950; A-121 to S-950; 

L-122 to S-950; S-123 to S-950; L-124 to S-950; C-125 to S-950; E-126 to S-950; G-127 
to S-950; V-128 to S-950; R-129 to S-950; G-130 to S-950; A-131 to S-950; F-132 to S- 
950; Y-133 to S-950; L-134 to S-950; L-135 to S-950; G-136 to S-950; E-137 to S-950; A- 
138 to S-950; Y-139 to S-950; F-140 to S-950; 1-141 to S-950; Q-142 to S-950; P-143 to 

30 S-950; 1^144 to S-950; P-145 to S-950; A-146 to S-950; A-147 to S-950; S-148 to S-950; 

E-149 to S-950; R-150 to S-950; L-151 to S-950; A-152 to S-950; T-153 to S-950; A-154 
to S-950; A-155 to S-950; P-156 to S-950; G-157 to S-950; E-158 to S-950; K-159 to S-950; 
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P-160 to S-950; P-161 to S-950; A-162 to S-950; P-163 to S-950; L-164 to S-950; Q-165 
to S-950; F-166 to S-950; H-167 to S-950; L-168 to S-950; L-169 to S-950; R-170 to S-950; 
R-171 to S-950; N-172 to S-950; R-173 to S-950; Q-174 to S-950; G-175 to S-950; D-176 
to S-950; V-177 to S-950; G-178 to S-950; G-179 to S-950; T-180 to S-950; C-181 to S- 
5 950; G-182 to S-950; V-183 to S-950; V-184 to S-950; D-185 to S-950; D-186 to S-950; E- 

187 to S-950; P-188 to S-950; R-189 to S-950; P-190 to S-950; T-191 to S-950; G-192 to 
S-950; K-193 to S-950; A-194 to S-950; E-195 to S-950; T-196 to S-950; E-197 to S-950; 
D-198 to S-950; E-199 to S-950; D-200 to S-950; E-201 to S-950; G-202 to S-950; T-203 
to S-950; E-204 to S-950; G-205 to S-950; E-206 to S-950; D-207 to S-950; E-208 to S-950; 
10 G-209 to S-950; P-210 to S-950; Q-21 1 to S-950; W-212 to S-950; S-213 to S-950; P-214 

S to S-950; Q-2 1 5 to S-950; D-2 16 to S-950; P-2 17 to S-950; A-2 1 8 to S-950; L-2 19 to S-950; 

ia Q-220 to S-950; G-22 1 to S-950; V-222 to S-950; G-223 to S-950; Q-224 to S-950; P-225 

y to S-950; T-226 to S-950; G-227 to S-950; T-228 to S-950; G-229 to S-950; S-230 to S-950; 

'fl 1-23 1 to S-950; R-232 to S-950; K-233 to S-950; K-234 to S-950; R-235 to S-950; F-236 

W 15 to S-950; V-237 to S-950; S-238 to S-950; S-239 to S-950; H-240 to S-950; R-241 to S-950; 

Q Y-242 to S-950; V-243 to S-950; E-244 to S-950; T-245 to S-950; M-246 to S-950; L-247 

J:' to s-950; V-248 to S-950; A-249 to S-950; D-250 to S-950; Q-251 to S-950; S-252 to S- 

W 950; M-253 to S-950; A-254 to S-950; E-255 to S-950; F-256 to S-950; H-257 to S-950; G- 

5 258 to S-950; S-259 to S-950; G-260 to S-950; L-26 1 to S-950; K-262 to S-950; H-263 to 

20 S-950; Y-264 to S-950; L-265 to S-950; L-266 to S-950; T-267 to S-950; L-268 to S-950; 

F-269 to S-950; S-270 to S-950; V-271 to S-950; A-272 to S-950; A-273 to S-950; R-274 
to S-950; L-275 to S-950; Y-276 to S-950; K-277 to S-950; H-278 to S-950; P-279 to S-950; 
S-280 to S-950; 1-281 to S-950; R-282 to S-950; N-283 to S-950; S-284 to S-950; V-285 to 
S-950; S-286 to S-950; L-287 to S-950; V-288 to S-950; V-289 to S-950; V-290 to S-950; 
25 K-291 to S-950; 1-292 to S-950; L-293 to S-950; V-294 to S-950; 1-295 to S-950; H-296 to 

S-950; D-297 to S-950; E-298 to S-950; Q-299 to S-950; K-300 to S-950; G-301 to S-950; 
P-302 to S-950; E-303 to S-950; V-304 to S-950; T-305 to S-950; S-306 to S-950; N-307 
to S-950; A-308 to S-950; A-309 to S-950; L-3 10 to S-950; T-3 1 1 to S-950; L-3 12 to S-950; 
R-313 to S-950; N-314 to S-950; F-315 to S-950; C-316 to S-950; N-317 to S-950; W-318 
30 to S-950; Q-3 19 to S-950; K-320 to S-950; Q-321 to S-950; H-322 to S-950; N-323 to S- 

950; P-324 to S-950; P-325 to S-950; S-326 to S-950; D-327 to S-950; R-328 to S-950; D- 
329 to S-950; A-330 to S-950; E-331 to S-950; H-332 to S-950; Y-333 to S-950; D-334 to 



-208- 



S-950; T-335 to S-950; A-336 to S-950; 1-337 to S-950; L-338 to S-950; F-339 to S-950; 
T-340 to S-950; R-341 to S-950; Q-342 to S-950; D-343 to S-950; L-344 to S-950; C-345 
to S-950; G-346 to S-950; S-347 to S-950; Q-348 to S-950; T-349 to S-950; C-350 to S-950; 
D-351 to S-950; T-352 to S-950; L-353 to S-950; G-354 to S-950; M-355 to S-950; A-356 
5 to S-950; D-357 to S-950; V-358 to S-950; G-359 to S-950; T-360 to S-950; V-361 to S- 

950; C-362 to S-950; D-363 to S-950; P-364 to S-950; S-365 to S-950; R-366 to S-950; S- 
367 to S-950; C-368 to S-950; S-369 to S-950; V-370 to S-950; 1-371 to S-950; E-372 to 
S-950; D-373 to S-950; D-374 to S-950; G-375 to S-950; L-376 to S-950; Q-377 to S-950; 
A-378 to S-950; A-379 to S-950; F-380 to S-950; T-381 to S-950; T-382 to S-950; A-383 
10 to S-950; H-384 to S-950; E-385 to S-950; L-386 to S-950; G-387 to S-950; H-388 to S- 

0 950; V-389 to S-950; F-390 to S-950; N-391 to S-950; M-392 to S-950; P-393 to S-950; H- 
M 394 to S-950; D-395 to S-950; D-396 to S-950; A-397 to S-950; K-398 to S-950; Q-399 to 

1 y S-950; C-400 to S-950; A-401 to S-950; S-402 to S-950; L-403 to S-950; N-404 to S-950; 
f J G-405 to S-950; V-406 to S-950; N-407 to S-950; Q-408 to S-950; D-409 to S-950; S-4 10 
C015 to S-950; H-41 1 to S-950; M-412 to S-950; M-413 to S-950; A-414 to S-950; S-415 to S- 
□ 950; M-416 to S-950; L-417 to S-950; S-418 to S-950; N-419 to S-950; L-420 to S-950; D- 
fl 421 to S-950; H-422 to S-950; S-423 to S-950; Q-424 to S-950; P-425 to S-950; W-426 to 
W s-950; S-427 to S-950; P-428 to S-950; C-429 to S-950; S-430 to S-950; A-43 1 to S-950; 
Jl Y-432 to S-950; M-433 to S-950; 1-434 to S-950; T-435 to S-950; S-436 to S-950; F-437 

20 to S-950; L-438 to S-950; D-439 to S-950; N-440 to S-950; G-441 to S-950; H-442 to S- 

950; G-443 to S-950; E-444 to S-950; C-445 to S-950; L-446 to S-950; M-447 to S-950; D- 
448 to S-950; K-449 to S-950; P-450 to S-950; Q-451 to S-950; N-452 to S-950; P-453 to 
S-950; 1-454 to S-950; Q-455 to S-950; L-456 to S-950; P-457 to S-950; G-458 to S-950; 
D-459 to S-950; L-460 to S-950; P-461 to S-950; G-462 to S-950; T-463 to S-950; S-464 

25 to S-950; Y-465 to S-950; D-466 to S-950; A-467 to S-950; N-468 to S-950; R-469 to S- 

950; Q-470 to S-950; C-471 to S-950; Q-472 to S-950; F-473 to S-950; T-474 to S-950; F- 
475 to S-950; G-476 to S-950; E-477 to S-950; D-478 to S-950; S-479 to S-950; K-480 to 
S-950; H-481 to S-950; C-482 to S-950; P-483 to S-950; D-484 to S-950; A-485 to S-950; 
A-486 to S-950; S-487 to S-950; T-488 to S-950; C-489 to S-950; S-490 to S-950; T-491 

30 to S-950; L-492 to S-950; W-493 to S-950; C-494 to S-950; T-495 to S-950; G-496 to S- 

950; T-497 to S-950; S-498 to S-950; G-499 to S-950; G-500 to S-950; V-501 to S-950; L- 
502 to S-950; V-503 to S-950; C-504 to S-950; Q-505 to S-950; T-506 to S-950; K-507 to 
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S-950; H-508 to S-950; F-509 to S-950; P-5 10 to S-950; W-5 1 1 to S-950; A-512 to S-950; 
D-513 to S-950; G-514 to S-950; T-515 to S-950; S-516 to S-950; C-517 to S-950; G-518 
to S-950; E-519 to S-950; G-520 to S-950; K-521 to S-950; W-522 to S-950; C-523 to S- 
950; 1-524 to S-950; N-525 to S-950; G-526 to S-950; K-527 to S-950; C-528 to S-950; V- 
5 529 to S-950; N-530 to S-950; K-53 1 to S-950; T-532 to S-950; D-533 to S-950; R-534 to 

S-950; K-535 to S-950; H-536 to S-950; F-537 to S-950; D-538 to S-950; T-539 to S-950; 
P-540 to S-950; F-541 to S-950; H-542 to S-950; G-543 to S-950; S-544 to S-950; W-545 
to S-950; G-546 to S-950; M-547 to S-950; W-548 to S-950; G-549 to S-950; P-550 to S- 
950; W-551 to S-950; G-552 to S-950; D-553 to S-950; C-554 to S-950; S-555 to S-950; 

10 R-556 to S-950; T-557 to S-950; C-558 to S-950; G-559 to S-950; G-560 to S-950; G-561 

to S-950; V-562 to S-950; Q-563 to S-950; Y-564 to S-950; T-565 to S-950; M-566 to S- 
950; R-567 to S-950; E-568 to S-950; C-569 to S-950; D-570 to S-950; N-571 to S-950; P- 
572 to S-950; V-573 to S-950; P-574 to S-950; K-575 to S-950; N-576 to S-950; G-577 to 
S-950; G-578 to S-950; K-579 to S-950; Y-580 to S-950; C-581 to S-950; E-582 to S-950; 

15 G-583 to S-950; K-584 to S-950; R-585 to S-950; V-586 to S-950; R-587 to S-950; Y-588 

to S-950; R-589 to S-950; S-590 to S-950; C-591 to S-950; N-592 to S-950; L-593 to S-950; 
E-594 to S-950; D-595 to S-950; C-596 to S-950; P-597 to S-950; D-598 to S-950; N-599 
to S-950; N-600 to S-950; G-601 to S-950; K-602 to S-950; T-603 to S-950; F-604 to S-950; 
R-605 to S-950; E-606 to S-950; E-607 to S-950; Q-608 to S-950; C-609 to S-950; E-610 

20 to S-950; A-61 1 to S-950; H-612 to S-950; N-613 to S-950; E-614 to S-950; F-615 to S-950; 

S-616 to S-950; K-617 to S-950; A-618 to S-950; S-619 to S-950; F-620 to S-950; G-621 
to S-950; S-622 to S-950; G-623 to S-950; P-624 to S-950; A-625 to S-950; V-626 to S-950; 
E-627 to S-950; W-628 to S-950; 1-629 to S-950; P-630 to S-950; K-631 to S-950; Y-632 
to S-950; A-633 to S-950; G-634 to S-950; V-635 to S-950; S-636 to S-950; P-637 to S-950; 

25 K-638 to S-950; D-639 to S-950; R-640 to S-950; C-641 to S-950; K-642 to S-950; jL-643 

to S-950; 1-644 to S-950; C-645 to S-950; Q-646to S-950; A-647 to S-950; K-648 to S-950; 
G-649 to S-950; 1-650 to S-950; G-651 to S-950; Y-652 to S-950; F-653 to S-950; F-654 
to S-950; V-655 to S-950; L-656 to S-950; Q-657 to S-950; P-658 to S-950; K-659 to S-950; 
V-660 to S-950; V-661 to S-950; D-662 to S-950; G-663 to S-950; T-664 to S-950; P-665 

30 to S-950; C-666 to S-950; S-667 to S-950; P-668 to S-950; D-669 to S-950; S-670 to S-950; 

T-671 to S-950; S-672 to S-950; V-673 to S-950; C-674to S-950; V-675 to S-950; Q-676 
to S-950; G-677 to S-950; Q-678 to S-950; C-679 to S-950; V-680 to S-950; K-681 to S- 
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950; A-682 to S-950; G-683 to S-950; C-684 to S-950; D-685 to S-950; R-686 to S-950; I- 
687 to S-950; 1-688 to S-950; D-689 to S-950; S-690 to S-950; K-691 to S-950; K-692 to 
S-950; K-693 to S-950; F-694 to S-950; D-695 to S-950; K-696 to S-950; C-697 to S-950; 
G-698 to S-950; V-699 to S-950; C-700 to S-950; G-701 to S-950; G-702 to S-950; N-703 
5 to S-950; G-704 to S-950; S-705 to S-950; T-706 to S-950; C-707 to S-950; K-708 to S-950; 

K-709 to S-950; 1-710 to S-950; S-711 to S-950; G-712 to S-950; S-713 to S-950; V-714 
to S-950; T-715 to S-950; S-716 to S-950; A-717 to S-950; K-718 to S-950; P-719 to S-950; 
G-720 to S-950; Y-721 to S-950; H-722 to S-950; D-723 to S-950; 1-724 to S-950; 1-725 to 
S-950; T-726 to S-950; 1-727 to S-950; P-728 to S-950; T-729 to S-950; G-730 to S-950; 

10 A-73 1 to S-950; T-732 to S-950; N-733 to S-950; 1-734 to S-950; E-735 to S-950; V-736 

to S-950; K-737 to S-950; Q-738 to S-950; R-739 to S-950; N-740 to S-950; Q-741 to S- 
950; R-742 to S-950; G-743 to S-950; S-744 to S-950; R-745 to S-950; N-746 to S-950; N- 
747 to S-950; G-748 to S-950; S-749 to S-950; F-750 to S-950; L-751 to S-950; A-752 to 
S-950; 1-753 to S-950; K-754 to S-950; A-755 to S-950; A-756 to S-950; D-757 to S-950; 

15 G-758 to S-950; T-759 to S-950; Y-760 to S-950; 1-761 to S-950; L-762 to S-950; N-763 

to S-950; G-764 to S-950; D-765 to S-950; Y-766 to S-950; T-767 to S-950; L-768 to S- 
950; S-769 to S-950; T-770 to S-950; L-771 to S-950; E-772 to S-950; Q-773 to S-950; D- 
774 to S-950; 1-775 to S-950; M-776 to S-950; Y-777 to S-950; K-778 to S-950; G-779 to 
S-950; V-780 to S-950; V-781 to S-950; L-782 to S-950; R-783 to S-950; Y-784 to S-950; 

20 S-785 to S-950; G-786 to S-950; S-787 to S-950; S-788 to S-950; A-789 to S-950; A-790 

to S-950; L-791 to S-950; E-792 to S-950; R-793 to S-950; 1-794 to S-950; R-795 to S-950; 
S-796 to S-950; F-797 to S-950; S-798 to S-950; P-799 to S-950; L-800 to S-950; K-801 to 
S-950; E-802 to S-950; P-803 to S-950; L-804 to S-950; T-805 to S-950; 1-806 to S-950; Q- 
807 to S-950; V-808 to S-950; L-809 to S-950; T-810 to S-950; V-811 to S-950; G-812 to 

25 S-950; N-813 to S-950; A-814 to S-950; L-815 to S-950; R-816 to S-950; P-817 to S-950; 

K-818 to S-950; 1-819 to S-950; K-820 to S-950; Y-821 to S-950; T-822 to S-950; Y-823 
to S-950; F-824 to S-950; V-825 to S-950; K-826 to S-950; K-827 to S-950; K-828 to S- 
950; K-829 to S-950; E-830 to S-950; S-831 to S-950; F-832 to S-950; N-833 to S-950; A- 
834 to S-950; 1-835 to S-950; P-836 to S-950; T-837 to S-950; F-838 to S-950; S-839 to S- 

30 950; A-840 to S-950; W-841 to S-950; V-842 to S-950; 1-843 to S-950; E-844 to S-950; E- 

845 to S-950; W-846 to S-950; G-847 to S-950; E-848 to S-950; C-849 to S-950; S-850 to 
S-950; K-851 to S-950; S-852 to S-950; C-853 to S-950; E-854 to S-950; L-855 to S-950; 
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G-856 to S-950; W-857 to S-950; Q-858 to S-950; R-859 to S-950; R-860 to S-950; L-861 
to S-950; V-862 to S-950; E-863 to S-950; C-864 to S-950; R-865 to S-950; D-866 to S- 
950; 1-867 to S-950; N-868 to S-950; G-869 to S-950; Q-870 to S-950; P-871 to S-950; A- 
872 to S-950; S-873 to S-950; E-874 to S-950; C-875 to S-950; A-876 to S-950; K-877 to 
5 S-950; E-878 to S-950; V-879 to S-950; K-880 to S-950; P-88 1 to S-950; A-882 to S-950; 

S-883 to S-950; T-884 to S-950; R-885 to S-950; P-886 to S-950; C-887 to S-950; A-888 
to S-950; D-889 to S-950; H-890 to S-950; P-891 to S-950; C-892 to S-950; P-893 to S-950; 
Q-894 to S-950; W-895 to S-950; Q-896 to S-950; L-897 to S-950; G-898 to S-950; E-899 
to S-950; W-900 to S-950; S-901 to S-950; S-902 to S-950; C-903 to S-950; S-904 to S-950; 
1^ K-905 to S-950; T-906 to S-950; C-907 to S-950; G-908 to S-950; K-909 to S-950; G-910 

si to S-950; Y-91 1 to S-950; K-912 to S-950; K-913 to S-950; R-914 to S-950; S-915 to S- 

P 950; L-916 to S-950; K-917 to S-950; C-918 to S-950; L-919 to S-950; S-920 to S-950; H- 

J5 921 to S-950; D-922 to S-950; G-923 to S-950; G-924 to S-950; V-925 to S-950; L-926 to 

r S-950; S-927 to S-950; H-928 to S-950; E-929 to S-950; S-930 to S-950; C-931 to S-950; 

ll D-932 to S-950; P-933 to S-950; L-934 to S-950; K-935 to S-950; K-936 to S-950; P-937 

to S-950; K-938 to S-950; H-939 to S-950; F-940 to S-950; 1-941 to S-950; D-942 to S-950; 
« F-943 to S-950; C-944 to S-950; T-945 to S-950; of SEQ ID NO:2. 

Moreover, C-terminal deletions of the METH 1 polypeptide can also be described by 
the general formula where ni is an integer from 2 to 950, where n corresponds to the 
20 position of amino acid residue identified in SEQ ID NO:2. Preferably, C-terminal deletions 

of the METHl polypeptide of the invention shown as SEQ ID NO:2 include polypeptides 
comprising the amino acid sequence of residues: M-1 to C-949; M-1 to E-948; M-1 to A- 
947; M-1 to M-946; M-1 to T-945; M-1 to C-944; M-1 to F-943; M-1 to D-942; M-1 to I- 
941; M-1 to F-940; M-1 to H-939; M-1 to K-938; M-1 to P-937; M-1 to K-936; M-1 to K- 
25 935; M-1 to L-934; M-1 to P-933; M-1 to D-932; M-1 to C-931; M-1 to S-930; M-1 to E- 

929; M-1 to H-928; M-1 to S-927; M-1 to L-926; M-1 to V-925; M-1 to G-924; M-1 to G- 
923; M-1 to D-922; M-1 to H-921; M-1 to S-920; M-1 to L-919; M-1 to C-918; M-1 to K- 
917; M-1 to L-916; M-1 to S-915; M-1 to R-914; M-1 to K-913; M-1 to K-912; M-1 to Y- 
911; M-1 to G-910; M-1 to K-909; M-1 to G-908; M-1 to C-907; M-1 to T-906; M-1 to K- 
30 905; M-1 to S-904; M-1 to C-903; M-1 to S-902; M-1 to S-901; M-1 to W-900; M-1 to E- 

899; M-1 to G-898; M-1 to L-897; M-1 to Q-896; M-1 to W-895; M-1 to Q-894; M-1 to P- 
893; M-1 to C-892; M-1 to P-891; M-1 to H-890; M-1 to D-889; M-1 to A-888; M-1 to C- 
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887; M-1 to P-886; M-1 to R-885; M-1 to T-884; M-1 to S-883; M-1 to A-882; M-1 to P- 
881; M-1 to K-880; M-1 to V-879; M-1 to E-878; M-1 to K-877; M-1 to A-876; M-1 to C- 
875; M-1 to E-874; M-1 to S-873; M-1 to A-872; M-1 to P-871; M-1 to Q-870; M-1 to G- 
869; M-1 to N-868; M-1 to 1-867; M-1 to D-866; M-1 to R-865; M-1 to C-864; M-1 to E- 
863; M-1 to V-862; M-1 to L-861; M-1 to R-860; M-1 to R-859; M-1 to Q-858; M-1 to W- 
857; M-1 to G-856; M-1 to L-855; M-1 to E-854; M-1 to C-853; M-1 to S-852; M-1 to K- 
851; M-1 to S-850; M-1 to C-849; M-1 to E-848; M-1 to G-847; M-1 to W-846; M-1 to E- 
845; M-1 to E-844; M-1 to 1-843; M-1 to V-842; M-1 to W-841; M-1 to A-840; M-1 to S- 
839; M-1 to F-838; M-1 to T-837; M-1 to P-836; M-1 to 1-835; M-1 to A-834; M-1 to N- 
833; M-1 to F-832; M-1 to S-831; M-1 to E-830; M-1 to K-829; M-1 to K-828; M-1 to K- 
827; M-1 to K-826; M-1 to V-825; M-1 to F-824; M-1 to Y-823; M-1 to T-822; M-1 to Y- 
821; M-1 to K-820; M-1 to 1-819; M-1 to K-818; M-1 to P-817; M-1 to R-816; M-1 to L- 
815; M-1 to A-8 14; M-1 to N-8 13; M-1 to G-812; M-1 to V-811; M-1 to T-810; M-1 to L- 
809; M-1 to V-808; M-1 to Q-807; M-1 to 1-806; M-1 to T-805; M-1 to L-804; M-1 to P- 
803; M-1 to E-802; M-1 to K-801; M-1 to L-800; M-1 to P-799; M-1 to S-798; M-1 to F- 
797; M-1 to S-796; M-1 to R-795; M-1 to 1-794; M-1 to R-793; M-1 to E-792; M-1 to L- 
791; M-1 to A-790; M-1 to A-789; M-1 to S-788; M-1 to S-787; M-1 to G-786; M-1 to S- 
785; M-1 to Y-784; M-1 to R-783; M-1 to L-782; M-1 to V-781; M-1 to V-780; M-1 to G- 
779; M-1 to K-778; M-1 to Y-777; M-1 to M-776; M-1 to 1-775; M-1 to D-774; M-1 to Q- 
773; M-1 to E-772; M-1 to L-771; M-1 to T-770; M-1 to S-769; M-1 to L-768; M-1 to T- 
767; M-1 to Y-766; M-1 to D-765; M-1 to G-764; M-1 to N-763; M-1 to L-762; M-1 to I- 
761; M-1 to Y-760; M-1 to T-759; M-1 to G-758; M-1 to D-757; M-1 to A-756; M-1 to A- 
755; M-1 to K-754; M-1 to 1-753; M-1 to A-752; M-1 to L-751; M-1 to F-750; M-1 to S- 
749; M-1 to G-748; M-1 to N-747; M-1 to N-746; M-1 to R-745; M-1 to S-744; M-1 to G- 
743; M-1 to R-742; M-1 to Q-741; M-1 to N-740; M-1 to R-739; M-1 to Q-738; M-1 to K- 
737; M-1 to V-736; M-1 to E-735; M-1 to 1-734; M-1 to N-733; M-1 to T-732; M-1 to A- 
731;M-1 toG-730;M-l toT-729;M-l toP-728;M-l toI-727;M-l toT-726;M-l to 1-725 
M-1 to 1-724; M-1 to D-723; M-1 to H-722; M-1 to Y-721; M-1 to G-720; M-1 to P-7I9 
M-1 to K-718; M-1 to A-717; M-1 to S-716; M-1 to T-715; M-1 to V-714; M-1 to S-713 
M-1 to G-712; M-1 to S-711; M-1 to 1-710; M-1 to K-709; M-1 to K-708; M-1 to C-707 
M-1 to T-706; M-1 to S-705; M-1 to G-704; M-1 to N-703; M-1 to G-702; M-1 to G-701 
M-1 to C-700; M-1 to V-699; M-1 to G-698; M-1 to C-697; M-1 to K-696; M-1 to D-695 
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M-1 to F-694; M-1 to K-693; M-1 to K-692; M-1 to K-691; M-1 to S-690; M-1 to D-689; 
M-1 to 1-688; M-1 to 1-687; M-1 to R-686; M-1 to D-685; M-1 to C-684; M-1 to G-683; M- 



I to A-682; M-1 to K-681; M-1 to V-680; M-1 to C-679; M-1 to Q-678; M-1 to G-677 
to Q-676; M-1 to V-675; M-1 to C-674; M-1 to V-673; M-1 to S-672; M-1 to T-671 
to S-670; M-1 to D-669; M-1 to P-668; M-1 to S-667; M-1 to C-666; M-1 to P-665 
to T-664; M-1 to G-663; M-1 to D-662; M-1 to V-661; M-1 to V-660; M-1 to K-659: 
to P-658; M-1 to Q-657; M-1 to L-656; M-1 to V-655; M-1 to F-654; M-1 to F-653 
to Y-652; M-1 to G-651; M-1 to 1-650; M-1 to G-649; M-1 to K-648; M-1 to A-647 
to Q-646; M-1 to C-645; M-1 to 1-644; M-1 to L-643; M-1 to K-642; M-1 to C-641 
to R-640; M-1 to D-639; M-1 to K-638; M-1 to P-637; M-1 to S-636; M-1 to V-635 
to G-634; M-1 to A-633; M-1 to Y-632; M-1 to K-631; M-1 to P-630; M-1 to 1-629 
to W-628; M-1 to E-627; M-1 to V-626; M-1 to A-625; M-1 to P-624; M-1 to G-623 



to S-622 
toS-616 
toE-610 
to F-604 
to D-598 
to N-592 
to V-586 
to Y-580: 
to P-574; 
to E-568 
to V-562: 
to R-556 
toP-550: 
to S-544: 
to D-538 
toT-532 
to G-526; 
to G-520 
toG-514: 
toH-508 



M-1 to G-621; M-1 to F-620; M-1 to S-619; M-1 to A-618; M-1 to K-617 
M-1 to F-615; M-1 to E-614; M-1 to N-613; M-1 to H-612; M-1 to A-611 
M-1 to C-609; M-1 to Q-608; M-1 to E-607; M-1 to E-606; M-1 to R-605 
M-1 to T-603; M-1 to K-602; M-1 to G-601; M-1 to N-600; M-1 to N-599 
M-1 to P-597; M-1 to C-596; M-1 to D-595; M-1 to E-594; M-1 to L-593 
M-1 to C-591; M-1 to S-590; M-1 to R-589; M-1 to Y-588; M-1 to R-587 
M-1 to R-585; M-1 to K-584; M-1 to G-583; M-1 to E-582; M-1 to C-581 
M-1 to K-579; M-1 to G-578; M-1 to G-577; M-1 to N-576; M-1 to K-575 
M-1 to V-573; M-1 to P-572; M-1 to N-571; M-1 to D-570; M-1 to C-569 
M-1 to R-567; M-1 to M-566; M-1 to T-565; M-1 to Y-564; M-1 to Q-563 
M-1 to G-561; M-1 to G-560; M-1 to G-559; M-1 to C-558; M-1 to T-557 
M-1 to S-555; M-1 to C-554; M-1 to D-553; M-1 to G-552; M-1 to W-551 
M-1 to G-549; M-1 to W-548; M-1 to M-547; M-1 to G-546; M-1 to W-545 
M-1 to G-543; M-1 to H-542; M-1 to F-541; M-1 to P-540; M-1 to T-539 
M-1 to F-537; M-1 to H-536; M-1 to K-535; M-1 to R-534; M-1 to D-533 
M-1 to K-531; M-1 to N-530; M-1 to V-529; M-1 to C-528; M-1 to K-527 
M-1 to N-525; M-1 to 1-524; M-1 to C-523; M-1 to W-522; M-1 to K-521 
M-1 to E-519; M-1 to G-518; M-1 to C-517; M-1 to S-516; M-1 to T-515 
M-1 to D-5i3; M-1 to A-512; M-1 to W-511; M-l,to P-510; M-1 to F-509 
M-1 to K-507; M-1 to T-506; M-1 to Q-505; M-1 to C-504; M-1 to V-503 



M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
M-1 
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to L-502; M-1 to V-501; M-1 to G-500; M-1 to G-499; M-1 to S-498; M-1 to T-497; M-1 
to G-496; M-1 to T-495; M-1 to C-494; M-1 to W-493; M-1 to L-492; M-1 to T-491; M-1 
to S-490; M-1 to C-489; M-1 to T-488; M-1 to S-487; M-1 to A-486; M-1 to A-485; M-1 
to D-484; M-1 to P-483; M-1 to C-482; M-1 to H-481; M-1 to K-480; M-1 to S-479; M-1 
to D-478; M-1 to E-477; M-1 to G-476; M-1 to F-475; M-1 to T-474; M-1 to F-473; M-1 
to Q-472; M-1 to C-471; M-1 to Q-470; M-1 to R-469; M-1 to N-468; M-1 to A-467; M-1 
to D-466; M-1 to Y-465; M-1 to S-464; M-1 to T-463; M-1 to G-462; M-1 to P-461; M-1 
to L-460; M-1 to D-459; M-1 to G-458; M-1 to P-457; M-1 to L-456; M-1 to Q-455; M-1 
to 1-454; M-1 to P-453; M-1 to N-452; M-1 to Q-451; M-1 to P-450; M-1 to K-449; M-1 to 
D-448; M-1 to M-447; M-1 to L-446; M-1 to C-445; M-1 to E-444; M-1 to G-443; M-1 to 
H-442; M-1 to G-441; M-1 to N-440; M-1 to D-439; M-1 to L-438; M-1 to F-437; M-1 to 
S-436; M-1 to T-435; M-1 to 1-434; M-1 to M-433; M-1 to Y-432; M-1 to A-431; M-1 to 
S-430; M-1 to C-429; M-1 to P-428; M-1 to S-427; M-1 to W-426; M-1 to P-425; M-1 to 
Q-424; M-1 to S-423; M-1 to H-422; M-1 to D-421; M-1 to L-420; M-1 to N-419; M-1 to 
S-418;M-1 toL-417;M-l toM-416;M-l toS-415;M-l to A-414; M-1 toM-413;M-l to 
M-412; M-1 to H-41 1; M-1 to S-410; M-1 to D-409; M-1 to Q-408; M-1 to N-407; M-1 to 
V-406; M-1 to G-405; M-1 to N-404; M-1 to L-403; M-1 to S-402; M-1 to A-401; M-1 to 
C-400; M-1 to Q-399; M-1 to K-398; M-1 to A-397; M-1 to D-396; M-1 to D-395; M-1 to 
H-394; M-1 to P-393; M-1 to M-392; M-1 to N-391; M-1 to F-390; M-1 to V-389; M-1 to 
H-388; M-1 to G-387; M-1 to L-386; M-1 to E-385; M-1 to H-384; M-1 to A-383; M-1 to 
T-382; M-1 to T-381; M-1 to F-380; M-1 to A-379; M-1 to A-378; M-1 to Q-377; M-1 to 
L-376; M-1 to G-375; M-1 to D-374; M-1 to D-373; M-1 to E-372; M-1 to 1-371; M-1 to 
V-370; M-1 to S-369; M-1 to C-368; M-1 to S-367; M-1 to R-366; M-1 to S-365; M-1 to 
P-364; M-1 to D-363; M-1 to C-362; M-1 to V-361; M-1 to T-360; M-1 to G-359; M-1 to 
V-358; M-1 to D-357; M-1 to A-356; M-1 to M-355; M-1 to G-354; M-1 to L-353; M-1 to 
T-352; M-1 to D-351; M-1 to C-350; M-1 to T-349; M-1 to Q-348; M-1 to S-347; M-1 to 
G-346; M-1 to C-345; M-1 to L-344; M-1 to D-343; M-1 to Q-342; M-1 to R-341; M-1 to 
T-340; M-1 to F-339; M-1 to L-338; M-1 to 1-337; M-1 to A-336; M-1 to T-335; M-1 to D- 
334; M-1 to Y-333; M-1 to H-332; M-1 to E-331; M-1 to A-330; M-1 to D-329; M-1 to R- 
328; M-1 to D-327; M-1 to S-326; M-1 to P-325; M-1 to P-324; M-1 to N-323; M-1 to H- 
322; M-1 to Q-321; M-1 to K-320; M-1 to Q-3 19; M-1 to W-318; M-1 to N-317; M-1 to C- 
316; M-1 to F-315; M-1 to N-314; M-1 to R-313; M-1 to L-312; M-1 to T-31 1; M-1 to L- 
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310; M-1 to A-309; M-1 to A-308; M-1 to N-307; M-1 to S-306; M-1 to T-305; M-1 to V- 
304; M-1 to E-303; M-1 to P-302; M-1 to G-301; M-1 to K-300; M-1 to Q-299; M-1 to E- 
298; M-1 to D-297; M-1 to H-296; M-1 to 1-295; M-1 to V-294; M-1 to L-293; M-1 to I- 
292; M-1 to K-291; M-1 to V-290; M-1 to V-289; M-1 to V-288; M-1 to L-287; M-1 to S- 
5 286; M-1 to V-285; M-1 to S-284; M-1 to N-283; M-1 to R-282; M-1 to 1-281; M-1 to S- 

280; M-1 to P-279; M-1 to H-278; M-1 to K-277; M-1 to Y-276; M-1 to L-275; M-1 to R- 
274; M-1 to A-273; M-1 to A-272; M-1 to V-271; M-1 to S-270; M-1 to F-269; M-1 to L- 
268; M-1 to T-267; M-1 to L-266; M-1 to L-265; M-1 to Y-264; M-1 to H-263; M-1 to K- 
262; M-1 to L-261; M-1 to G-260; M-1 to S-259; M-1 to G-258; M-1 to H-257; M-1 to F- 

10 256; M-1 to E-255; M-1 to A-254; M-1 to M-253; M-1 to S-252; M-1 to Q-251; M-1 to D- 

250; M-1 to A-249; M-1 to V-248; M-1 to L-247; M-1 to M-246; M-1 to T-245; M-1 to E- 
244; M-1 to V-243; M-1 to Y-242; M-1 to R-241; M-1 to H-240; M-1 to S-239; M-1 to S- 
238; M-1 to V-237; M-1 to F-236; M-1 to R-235; M-1 to K-234; M-1 to K-233; M-1 to R- 
232; M-1 to 1-231; M-1 to S-230; M-1 to G-229; M-1 to T-228; M-1 to G-227; M-1 to T- 

15 226; M-1 to P-225; M-1 to Q-224; M-1 to G-223; M-1 to V-222; M-1 to G-221; M-1 to Q- 

220; M-1 to 1^219; M-1 to A-218; M-1 to P-217; M-1 to D-216; M-1 to Q-215; M-1 to P- 
214; M-1 to S-213; M-1 to W-212; M-1 to Q-21 1; M-1 to P-210; M-1 to G-209; M-1 to E- 
208; M-1 to D-207; M-1 to E-206; M-1 to G-205; M-1 to E-204; M-1 to T-203; M-1 to G- 
202; M-1 to E-201; M-1 to D-200; M-1 to E-199; M-1 to D-198; M-1 to E-197; M-1 to T- 

20 196; M-1 to E-195; M-1 to A-194; M-1 to K-193; M-1 to G-192; M-1 to T-191; M-1 to P- 

190; M-1 to R-189; M-1 to P-188; M-1 to E-187; M-1 to D-186; M-1 to D-185; M-1 to V- 
184; M-1 to V-183; M-1 to G-182; M-1 to C-181; M-1 to T-180; M-1 to G-179; M-1 to G- 
178; M-1 to V-177; M-1 to D-176; M-1 to G-175; M-1 to Q-174; M-1 to R-173; M-1 to N- 
172; M-1 to R-171; M-1 to R-170; M-1 to L-169; M-1 to L-168; M-1 to H-167; M-1 to F- 

25 166; M-1 to Q-165; M-1 to L-164; M-1 to P-163; M-1 to A-162; M-1 to P-161; M-1 to P- 

160; M-1 to K-159; M-1 to E-158; M-1 to G-157; M-1 to P-156; M-1 to A-155; M-1 to A- 
154; M-1 toT-153;M-l toA-152; M-1 toL-151;M-l toR-150;M-l to E- 149; M-1 toS- 
148; M-1 to A-147; M-1 to A-146; M-1 to P-145; M-1 to L-144; M-1 to P-143; M-1 to Q- 
142; M-1 to 1-141; M-1 to F-140; M-1 to Y-139; M-1 to A-138; M-1 to E-137; M-1 to G- 

30 136; M-1 to L-135; M-1 to L-134; M-1 to Y-133; M-1 to F-132; M-1 to A-131; M-1 to G- 

130; M-1 to R-129; M-1 to V-128; M-1 to G-127; M-1 to E-126; M-1 to C-125; M-1 to L- 
124; M-1 to S-123; M-1 to L-122; M-1 to A-121; M-1 to A-120; M-1 to A-119; M-1 to S- 
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1 18; M-1 to S-1 17; M-1 to P-1 16; M-1 to D-1 15; M-1 to G-1 14; M-1 to N-1 13; M-1 to V- 
112; M-1 to T-111; M-1 to G-UO; M-1 to S-109; M-1 to Y-108; M-1 to F-107; M-1 to C- 
106; M-1 to H-105; M-1 to A-104; M-1 to L-103; M-1 to D-102; M-1 to T-101; M-1 to E- 
100; M-1 to P-99; M-1 to L-98; M-1 to P-97; M-1 to T-96; M-1 to E-95; M-1 to S-94; M-1 

5 to G-93; M-1 to S-92; M-1 to K-91; M-1 to R-90; M-1 to G-89; M-1 to V-88; M-1 to N-87; 

M-1 to Q-86; M-1 to L-85; M-1 to T-84; M-1 to F-83; M-1 to G-82; M-1 to P-81; M-1 to 
A-80; M-1 to L-79; M-1 to F-78; M-1 to S-77; M-1 to S-76; M-1 to D-75; M-1 to P-74; M-1 
to R-73; M-1 to L-72; M-1 to E-71; M-1 to L-70; M-1 to D-69; M-1 to L-68; M-1 to Q-67; 
M-1 to Q-66; M-1 to D-65; M-1 to F-64; M-1 to A-63; M-1 to H-62; M-1 to L-61; M-1 to 

10 R-60; M-1 to L-59; M-1 to R-58; M-1 to T-57; M-1 to T-56; M-1 to G-55; M-1 to H-54; M- 

1 to G-53; M-1 to P-52; M-1 to A-51; M-1 to R-50; M-1 to E-49; M-1 to L-48; M-1 to E-47; 
M-1 to P-46; M-1 to V-45; M-1 to V-44; M-1 to L-43; M-1 to E-42; M-1 to E-41; M-1 to 
D-40; M-1 to E-39; M-1 to E-38; M-1 to S-37; M-1 to P-36; M-1 to R-35; M-1 to G-34; M- 
1 to L-33; M- 1 to A-32; M- 1 to D-3 1 ; M- 1 to S-30; M- 1 to V-29; M- 1 to A-28; M- 1 to 1^27; 

15 M-1 to L-26; M-1 to A-25; M-1 to A-24; M-1 to A-23; M-1 to L-22; M-1 to L-21; M-1 to 

L-20; M-1 to L-19; M-1 toT-18; M-1 to P-17; M-1 to V-16; M-1 to P-15; M-1 to G-14; M-1 
to F-13; M-1 to S-12; M-1 to R-11; M-1 to S-10; M-1 to G-9; M-1 to P-8; M-1 to A-7; of 
SEQ ID NO:2. For example, any of the above listed N- or C-terminal deletions can be 
combined to produce a N- and C-terminal deleted METHl polypeptide. Particularly 

20 preferred fragment of SEQ ID N02 are H542-Q894 and K80 1 -S950. 

Deletion mutants of METHl may also be made which comprise all or part of the 
additional sequence described in SEQ ID NO: 125. For example, exemplary deletion 
mutants include: Q-2 to S-967; R-3 to S-967; A-4 to S-967; V-5 to S-967; P-6 to S-967; E-7 
to S-967; G-8 to S-967; F-9 to S-967; G-10 to S-967; R-1 1 to S-976; R-12 to S-967; K-13 

25 to S-967; L-14 to S-967; G-15 to S-967; S-16 to S-967; D-17 to S-967; and M-18 to S-967. 

Moreover, N-terminal deletions of the METH2 polypeptide can be described by the 
general formula m2-890, where m2 is an integer from 2 to 889, where m corresponds to the 
position of the amino acid residue identified in SEQ ID NO:4. Preferably, N-terminal 
deletions of the METH2 polypeptide of the invention shown as SEQ ID NO:4 include 

30 polypeptides comprising the amino acid sequence of residues: F-2 to L-890; P-3 to L-890; 

A-4 to L-890; P-5 to L-890; A-6 to L-890; A-7 to L-890; P-8 to L-890; R-9 to L-890; W-10 
to L-890; L-1 1 to L-890; P-12 to L-890; F-13 to L-890; L-14 to L-890; L-15 to L-890; L-16 
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to L-890; L-17 to L-890; L-18 to L-890; L-19 to L-890; L-20 to L-890; L-21 to L-890; L-22 
to L-890; P-23 to L-890; L-24 to L-890; A-25 to L-890; R-26 to L-890; G-27 to L-890; A-28 
to L-890; P-29 to L-890; A-30 to L-890; R-3 1 to L-890; P-32 to L-890; A-33 to L-890; A-34 
to L-890; G-35 to L-890; G-36 to L-890; Q-37 to L-890; A-38 to L-890; S-39 to L-890; E-40 
5 to L-890; L-41 to L-890; V-42 to L-890; V-43 to L-890; P-44 to L-890; T-45 to L-890; R-46 

to L-890; L-47 to L-890; P-48 to L-890; G-49 to L-890; S-50 to L-890; A-5 1 to L-890; G-52 
to L-890; E-53 to L-890; L-54 to L-890; A-55 to L-890; L-56 to L-890; H-57 to L-890; L-58 
to L-890; S-59 to L-890; A-60 to L-890; F-6 1 to L-890; G-62 to L-890; K-63 to L-890; G-64 
to L-890; F-65 to L-890; V-66 to L-890; L-67 to L-890; R-68 to L-890; L-69 to L-890; A-70 
10 to L-890; P-7 1 to L-890; D-72 to L-890; D-73 to L-890; S-74 to L-890; F-75 to L-890; L-76 

3 to L-890; A-77 to L-890; P-78 to L-890; E-79 to L-890; F-80 to L-890; K-81 to L-890; 1-82 

to L-890; E-83 to L-890; R-84 to L-890; L-85 to L-890; G-86 to L-890; G-87 to L-890; S-88 
U; to L-890; G-89 to L-890; R-90 to L-890; A-9 1 to L-890; T-92 to L-890; G-93 to L-890; G- 

10 94 to L-890; E-95 to L-890; R-96 to L-890; G-97 to L-890; L-98 to L-890; R-99 to L-890; 

f\5 G-lOO to L-890; C-101 to L-890; F-102 to L-890; F-103 to L-890; S-104 to L-890; G-105 

5; to L-890; T-106 to L-890; V-107 to L-890; N-108 to L-890; G-109 to L-890; E-1 10 to L- 

f=I 890; P-1 1 1 to L-890; E-1 12 to L-890; S-1 13 to L-890; L-1 14 to L-890; A-1 15 to L-890; A- 

1 16 to L-890; V-1 17 to L-890; S-1 18 to L-890; L-1 19 to L-890; C-120 to L-890; R-121 to 
3 L-890; G-122 to L-890; 1^123 to L-890; S-124 to L-890; G-125 to L-890; S-126 to L-890; 

20 F-127 to L-890; L-128 to L-890; L-129 to L-890; D-130 to L-890; G-131 to L-890; E-132 

to L-890; E-133 to L-890; F-134 to L-890; T-135 to L-890; 1-136 to L-890; Q-137 to L-890; 
P-138 to L-890; Q-139 to L-890; G-140 to L-890; A-141 to L-890; G-142 to L-890; G-143 
to L-890; S- 1 44 to L-890; L- 145 to L-890; A- 146 to L-890; Q- 1 47 to L-890; P- 148 to L-890; 
H-149 to L-890; R-150 to L-890; L-151 to L-890; Q-152 to L-890; R-153 to L-890; W-154 
25 to L-890; G-155 to L-890; P-156 to L-890; A-157 to L-890; G-158 to L-890; A-159 to L- 

890; R-160 to L-890; P-161 to L-890; L-162 to L-890; P-163 to L-890; R-164 to L-890; G- 
165 to L-890; P-166 to L-890; E-167 to L-890; W-168 to L-890; E-169 to L-890; V-170 to 
L-890; E-171 to L-890; T-172 to L-890; G-173 to L-890; E-174 to L-890; G-175 to L-890; 
Q-176 to L-890; R-177 to L-890; Q-178 to L-890; E-179 to L-890; R-180 to L-890; G-181 
30 to L-890; D-182 to L-890; H-183 to L-890; Q-184 to L-890; E-185 to L-890; D-186 to L- 

890; S-1 87 to L-890; E-1 88 to L-890; E-1 89 to L-890; E-1 90 to L-890; S-191 to L-890; Q- 
192 to L-890; E-193 to L-890; E-194 to L-890; E-195 to L-890; A-196 to L-890; E-197 to 
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L-890; G-198 to L-890; A-199 to L-890; S-200 to L-890; E-201 to L-890; P-202 to L-890; 
P-203 to L-890; P-204 to L-890; P-205 to L-890; L-206 to L-890; G-207 to L-890; A-208 
to L-890; T-209 to L-890; S-210 to L-890; R-211 to L-890; T-212 to L-890; K-213 to L- 
890; R-214 to L-890; F-215 to L-890; V-216 to L-890; S-217 to L-890; E-218 to L-890; A- 

5 219 to L-890; R-220 to L-890; F-221 to L-890; V-222 to L-890; E-223 to L-890; T-224 to 

L-890; L-225 to L-890; L-226 to L-890; V-227 to L-890; A-228 to L-890; D-229 to L-890; 
A-230 to L-890; S-231 to L-890; M-232 to L-890; A-233 to L-890; A-234 to L-890; F-235 
to L-890; Y-236 to L-890; G-237 to L-890; A-238 to L-890; D-239 to L-890; L-240 to L- 
890; Q-241 to L-890; N-242 to L-890; H-243 to L-890; 1-244 to L-890; L-245 to L-890; T- 

10 246 to L-890; L-247 to L-890; M-248 to L-890; S-249 to L-890; V-250 to L-890; A-25 1 to 

L-890; A-252 to L-890; R-253 to L-890; 1-254 to L-890; Y-255 to L-890; K-256 to L-890; 
H-257 to L-890; P-258 to L-890; S-259 to L-890; 1-260 to L-890; K-261 to L-890; N-262 
to L-890; S-263 to L-890; 1-264 to L-890; N-265 to L-890; L-266 to L-890; M-267 to L-890; 
V-268 to L-890; V-269 to L-890; K-270 to L-890; V-271 to L-890; L-272 to L-890; 1-273 

15 to L-890; V-274 to L-890; E-275 to L-890; D-276 to L-890; E-277 to L-890; K-278 to L- 

890; W-279 to L-890; G-280 to L-890; P-281 to L-890; E-282 to L-890; V-283 to L-890; 
S-284 to L-890; D-285 to L-890; N-286 to L-890; G-287 to L-890; G-288 to L-890; L-289 
to L-890; T-290 to L-890; L-29 1 to L-890; R-292 to L-890; N-293 to L-890; F-294 to L-890; 
C-295 to L-890; N-296 to L-890; W-297 to 1^890; Q-298 to L-890; R-299 to L-890; R-300 

20 to L-890; F-301 to L-890; N-302 to L-890; Q-303 to L-890; P-304 to L-890; S-305 to L-890; 

D-306 to L-890; R-307 to L-890; H-308 to L-890; P-309 to L-890; E-310 to L-890; H-3 1 1 
to L-890; Y-3 1 2 to L-890; D-3 1 3 to L-890; T-3 14 to L-890; A-3 1 5 to L-890; 1-3 1 6 to L-890; 
L-317 to L-890; L-318 to L-890; T-319 to L-890; R-320 to L-890; Q-321 to L-890; N-322 
to L-890; F-323 to L-890; C-324 to L-890; G-325 to L-890; Q-326 to L-890; E-327 to L- 

25 890; G-328 to L-890; L-329 to L-890; C-330 to L-890; D-33 1 to L-890; T-332 to L-890; L- 

333 to L-890; G-334 to L-890; V-335 to L-890; A-336 to L-890; D-337 to L-890; 1-338 to 
L-890; G-339 to L-890; T-340 to L-890; 1-341 to L-890; C-342 to L-890; D-343 to L-890; 
P-344 to L-890; N-345 to L-890; K-346 to L-890; S-347 to L-890; C-348 to L-890; S-349 
to L-890; V-350 to L-890; 1-35 1 to L-890; E-352 to L-890; D-353 to L-890; E-354 to L-890; 

30 G-355 to L-890; L-356 to L-890; Q-357 to L-890; A-358 to L-890; A-359 to L-890; H-360 

to L-890; T-361 to L-890; L-362 to L-890; A-363 to L-890; H-364 to L-890; E-365 to L- 
890; L-366 to L-890; G-367 to L-890; H-368 to L-890; V-369 to L-890; L-370 to L-890; S- 
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37 1 to L-890; M-372 to L-890; P-373 to L-890; H-374 to L-890; D-375 to L-890; D-376 to 
L-890; S-377 to L-890; K-378 to L-890; P-379 to L-890; C-380 to L-890; T-381 to L-890; 
R-382 to L-890; L-383 to L-890; F-384 to L-890; G-385 to L-890; P-386 to L-890; M-387 
to L-890; G-388 to L-890; K-389 to L-890; H-390 to L-890; H-391 to L-890; V-392 to L- 

5 890; M-393 to L-890; A-394 to L-890; P-395 to L-890; L-396 to L-890; F-397 to L-890; V- 

398 to L-890; H-399 to L-890; L-400 to L-890; N-401 to L-890; Q-402 to L-890; T-403 to 
L-890; L-404 to L-890; P-405 to L-890; W-406 to L-890; S-407 to L-890; P-408 to L-890; 
C-409 to L-890; S-410 to L-890; A-41 1 to L-890; M-412 to L-890; Y-413 to L-890; L-414 
to L-890; T-415 to L-890; E-416 to L-890; L-417 to L-890; L-4 18 to L-890; D-419 to L-890; 

10 G-420 to L-890; G-42 1 to L-890; H-422 to L-890; G-423 to L-890; D-424 to L-890; C-425 

to L-890; L-426 to L-890; L-427 to L-890; D-428 to L-890; A-429 to L-890; P-430 to L- 
890; G-431 to L-890; A-432 to L-890; A-433 to L-890; L-434 to L-890; P-435 to L-890; L- 
436 to L-890; P-437 to L-890; T-438 to L-890; G-439 to L-890; L-440 to L-890; P-441 to 
L-890; G-442 to L-890; R-443 to L-890; M-444 to L-890; A-445 to L-890; L-446 to L-890; 

1 5 Y-447 to L-890; Q-448 to L-890; L-449 to L-890; D-450 to L-890; Q-45 1 to L-890; Q-452 

to L-890; C-453 to L-890; R-454 to L-890; Q-455 to L-890; 1-456 to L-890; F-457 to L-890; 
G-458 to L-890; P-459 to L-890; D-460 to L-890; F-461 to L-890; R-462 to L-890; H-463 
to L-890; C-464 to L-890; P-465 to L-890; N-466 to L-890; T-467 to L-890; S-468 to L-890; 
A-469 to L-890; Q-470 to L-890; D-471 to L-890; V-472 to L-890; C-473 to L-890; A-474 

20 to L-890; Q-475 to L-890; L-476 to L-890; W-477 to L-890; C-478 to L-890; H-479 to L- 

890; T-480 to L-890; D-481 to L-890; G-482 to L-890; A-483 to L-890; E-484 to L-890; P- 
485 to L-890; L-486 to L-890; C-487 to L-890; H-488 to L-890; T-489 to L-890; K-490 to 
L-890; N-491 to L-890; G-492 to L-890; S-493 to L-890; L-494 to L-890; P-495 to L-890; 
W-496 to L-890; A-497 to L-890; D-498 to L-890; G-499 to L-890; T-500 to L-890; P-501 

25 to L-890; C-502 to L-890; G-503 to L-890; P-504 to 1^890; G-505 to L-890; H-506 to L- 

890; L-507 to L-890; C-508 to L-890; S-509 to L-890; E-510 to L-890; G-5 1 1 to L-890; S- 
512 to L-890; C-513 to L-890; L-514 to L-890; P-515 to L-890; E-516 to L-890; E-517 to 
L-890; E-518 to L-890; V-519 to L-890; E-520 to L-890; R-521 to L-890; P-522 to L-890; 
K-523 to L-890; P-524 to L-890; V-525 to L-890; V-526 to L-890; D-527 to L-890; G-528 

30 to L-890; G-529 to L-890; W-530 to L-890; A-53 1 to L-890; P-532 to L-890; W-533 to L- 

890; G-534 to L-890; P-535 to L-890; W-53^to L-890; G-537 to L-890; E-538 to L-890; 
C-539 to L-890; S-540 to L-890; R-541 to L-890; T-542 to L-890; C-543 to L-890; G-544 
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to L-890; G-545 to L-890; G-546 to 1^890; V-547 to L-890; Q-548 to L-890; F-549 to L- 
890; S-550 to L-890; H-551 to L-890; R-552 to L-890; E-553 to L-890; C-554 to L-890; K- 
555 to L-890; D-556 to L-890; P-557 to L-890; E-558 to L-890; P-559 to L-890; Q-560 to 
L-890; N-561 to L-890; G-562 to L-890; G-563 to L-890; R-564 to L-890; Y-565 to L-890; 
5 C-566 to L-890; L-567 to L-890; G-568 to L-890; R-569 to L-890; R-570 to L-890; A-57 1 

to L-890; K-572 to L-890; Y-573 to L-890; Q-574 to L-890; S-575 to L-890; C-576 to L- 
890; H-577 to L-890; T-578 to L-890; E-579 to L-890; E-580 to L-890; C-581 to L-890; P- 
582 to L-890; P-583 to L-890; D-584 to L-890; G-585 to L-890; K-586 to L-890; S-587 to 
L-890; F-588 to L-890; R-589 to L-890; E-590 to L-890; Q-591 to L-890; Q-592 to L-890; 

10 C-593 to L-890; E-594 to L-890; K-595 to L-890; Y-596 to L-890; N-597 to L-890; A-598 

to L-890; Y-599 to L-890; N-600 to L-890; Y-601 to L-890; T-602 to L-890; D-603 to L- 
890; M-604 to L-890; D-605 to L-890; G-606 to L-890; N-607 to L-890; L-608 to L-890; 
L-609 to L-890; Q-610 to L-890; W-61 1 to L-890; V-612 to L-890; P-613 to L-890; K-614 
to L-890; Y-615 to L-890; A-616 to L-890; G-617 to L-890; V-618 to L-890; S-619 to L- 

15 890; P-620 to L-890; R-621 to L-890; D-622 to L-890; R-623 to L-890; C-624 to L-890; K- 

625 to L-890; L-626 to L-890; F-627 to L-890; C-628 to L-890; R-629 to L-890; A-630 to 
L-890; R-631 to L-890; G-632 to L-890; R-633 to L-890; S-634 to L-890; E-635 to L-890; 
F-636 to L-890; K-637 to L-890; V-638 to L-890; F-639 to L-890; E-640 to L-890; A-641 
to L-890; K-642 to L-890; V-643 to L-890; 1-644 to L-890; D-645 to L-890; G-646 to L- 

20 890; T-647 to L-890; L-648 to L-890; C-649 to L-890; G-650 to L-890; P-65 1 to L-890; E- 

652 to L-890; T-653 to L-890; L-654 to L-890; A-655 to L-890; 1-656 to L-890; C-657 to 
L-890; V-658 to L-890; R-659 to L-890; G-660 to L-890; Q-661 to L-890; C-662 to L-890; 
V-663 to L-890; K-664 to L-890; A-665 to L-890; G-666 to L-890; C-667 to L-890; D-668 
to L-890; H-669 to L-890; V-670 to L-890; V-671 to L-890; D-672 to L-890; S-673 to L- 

25 890; P-674 to L-890; R-675 to L-890; K-676 to L-890; L-677 to L-890; D-678 to L-890; K- 

679 to L-890; C-680 to L-890; G-681 to L-890; V-682 to L-890; C-683 to L-890; G-684 to 
L-890; G-685 to L-890; K-686 to L-890; G-687 to L-890; N-688 to L-890; S-689 to L-890; 
C-690 to L-890; R-691 to L-890; K-692 to L-890; V-693 to L-890; S-694 to L-890; G-695 
to L-890; S-696 to L-890; L-697 to L-890; T-698 to L-890; P-699 to L-890; T-700 to L-890; 

30 N-701 to L-890; Y-702 to L-890; G-703 to L-890; Y-704 to L-890; N-705 to L-890; D-706 

to L-890; 1-707 to L-890; V-708 to L-890; T-709 to L-890; 1-710 to L-890; P-7 1 1 to L-890; 
A-712 to L-890; G-713 to L-890; A-714 to L-890; T-715 to L-890; N-716 to L-890; 1-717 
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to L-890; D-718 to L-890; V-719 to L-890; K-720 to L-890; Q-721 to L-890; R-722 to L- 
890; S-723 to L-890; H-724 to L-890; P-725 to L-890; G-726 to L-890; V-727 to L-890; Q- 
728 to L-890; N-729 to L-890; D-730 to L-890; G-73 1 to L-890; N-732 to L-890; Y-733 to 
L-890; L-734 to L-890; A-735 to L-890; L-736 to L-890; K-737 to L-890; T-738 to L-890; 

5 A-739 to L-890; D-740 to L-890; G-741 to L-890; Q-742 to L-890; Y-743 to L-890; L-744 

to L-890; L-745 to L-890; N-746 to L-890; G-747 to L-890; N-748 to L-890; L-749 to L- 
890; A-750 to L-890; 1-751 to L-890; S-752 to L-890; A-753 to L-890; 1-754 to L-890; E- 
755 to L-890; Q-756 to L-890; D-757 to L-890; 1-758 to L-890; L-759 to L-890; V-760 to 
L-890; K-761 to L-890; G-762 to L-890; T-763 to L-890; 1-764 to L-890; L-765 to L-890; 

10 K-766 to L-890; Y-767 to L-890; S-768 to L-890; G-769 to L-890; S-770 to L-890; 1-771 

to L-890; A-772 to L-890; T-773 to L-890; L-774 to L-890; E-775 to L-890; R-776 to L- 
890; L-777 to L-890; Q-778 to L-890; S-779 to L-890; F-780 to L-890; R-781 to L-890; P- 
782 to L-890; L-783 to L-890; P-784 to L-890; E-785 to L-890; P-786 to L-890; L-787 to 
L-890; T-788 to L-890; V-789 to L-890; Q-790 to L-890; L-791 to L-890; L-792 to L-890; 

15 T-793 to L-890; V-794 to L-890; P-795 to L-890; G-796 to L-890; E-797 to L-890; V-798 

to L-890; F-799 to L-890; P-800 to L-890; P-801 to L-890; K-802 to L-890; V-803 to L-890; 
K-804 to L-890; Y-805 to L-890; T-806 to L-890; F-807 to L-890; F-808 to L-890; V-809 
to L-890; P-810 to L-890; N-811 to L-890; D-812 to L-890; V-813 to L-890; D-814 to L- 
890; F-815 to L-890; S-816 to L-890; M-817 to L-890; Q-818 to L-890; S-819 to L-890; S- 

20 820 to L-890; K-821 to L-890; E-822 to L-890; R-823 to L-890; A-824 to L-890; T-825 to 

L-890; T-826 to L-890; N-827 to L-890; 1-828 to L-890; 1-829 to L-890; Q-830 to L-890; 
P-831 to L-890; L-832 to L-890; L-833 to L-890; H-834 to L-890; A-835 to L-890; Q-836 
to L-890; W-837 to L-890; V-838 to L-890; L-839 to L-890; G-840 to L-890; D-841 to L- 
890; W-842 to L-890; S-843 to L-890; E-844 to L-890; C-845 to L-890; S-846 to L-890; S- 

25 847 to L-890; T-848 to L-890; C-849 to L-890; G-850 to L-890; A-85 1 to L-890; G-852 to 

L-890; W-853 to L-890; Q-854 to L-890; R-855 to L-890; R-856 to L-890; T-857 to L-890; 
V-858 to L-890; E-859 to L-890; C-860 to L-890; R-861 to L-890; D-862 to L-890; P-863 
to L-890; S-864 to L-890; G-865 to L-890; Q-866 to L-890; A-867 to L-890; S-868 to L- 
890; A-869 to L-890; T-870 to L-890; C-87 1 to L-890; N-872 to L-890; K-873 to L-890; A- 

30 874 to L-890; L-875 to L-890; K-876 to L-890; P-877 to L-890; E-878 to L-890; D-879 to 

L-890; A-880 to L-890; K-881 to L-890; P-882 to L-890; C-883 to L-890; E-884 to L-890; 
S-885 to L-890; of SEQ ID NO:4. 
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Moreover, C-terminal deletions of the METH2 polypeptide can also be described by 
the general formula l-nj, where is an integer from 2 to 890 where n corresponds to the 
position of amino acid residue identified in SEQ ID NO:4. Preferably, C-terminal deletions 
of the METH2 polypeptide of the invention shown as SEQ ID NO:4 include polypeptides 

5 comprising the amino acid sequence of residues: M-1 to P-889; M-1 to C-888; M-1 to L- 

887; M-1 to Q-886; M-1 to S-885; M-1 to E-884; M-1 to C-883; M-1 to P-882; M-1 to K- 
881; M-1 to A-880; M-1 to D-879; M-1 to E-878; M-1 to P-877; M-1 to K-876; M-1 to L- 
875; M-1 to A-874; M-1 to K-873; M-1 to N-872; M-1 to C-871; M-1 to T-870; M-1 to A- 
869; M-1 to S-868; M-1 to A-867; M-1 to Q-866; M-1 to G-865; M-1 to S-864; M-1 to P- 

10 863; M-1 to D-862; M-1 to R-861; M-1 to C-860; M-1 to E-859; M-1 to V-858; M-1 to T- 

857; M-1 to R-856; M-1 to R-855; M-1 to Q-854; M-1 to W-853; M-1 to G-852; M-1 to A- 
851; M-1 to G-850; M-1 to C-849; M-1 to T-848; M-1 to S-847; M-1 to S-846; M-1 to C- 
845; M-1 to E-844; M-1 to S-843; M-1 to W-842; M-1 to D-841; M-1 to G-840; M-1 to L- 
839; M-1 to V-838; M-1 to W-837; M-1 to Q-836; M-1 to A-835; M-1 to H-834; M-1 to L- 

15 833; M-1 to L-832; M-1 to P-831; M-1 to Q-830; M-1 to 1-829; M-1 to 1-828; M-1 toN-827; 

M-1 to T-826; M-1 to T-825; M-1 to A-824; M-1 to R-823; M-1 to E-822; M-1 to K-821; 
M-1 to S-820; M-1 to S-819; M-1 to Q-818; M-1 to M-817; M-1 to S-816; M-1 to F-815; 
M-1 to D-814; M-1 to V-813; M-1 to D-812; M-1 to N-811; M-1 to P-810; M-1 to V-809; 
M-1 to F-808; M-1 to F-807; M-1 to T-806; M-1 to Y-805; M-1 to K-804; M-1 to V-803; 

20 M-1 to K-802; M-1 to P-801; M-1 to P-800; M-1 to F-799; M-1 to V-798; M-1 to E-797; 

M-1 to G-796; M-1 to P-795; M-1 to V-794; M-1 to T-793; M-1 to L-792; M-1 to L-791; 
M-1 to Q-790; M-1 to V-789; M-1 to T-788; M-1 to L-787; M-1 to P-786; M-1 to E-785; 
M-1 to P-784; M-1 to L^783; M-1 to P-782; M-1 to R-78 1; M-1 to F-780; M-1 to S-779; M- 
1 to Q-778; M-1 to L-777; M-1 to R-776; M-1 to E-775; M-1 to L-774; M-1 to T-773; M-1 

25 to A-772; M-1 to 1-771; M-1 to S-770; M-1 to G-769; M-1 to S-768; M-1 to Y-767; M-1 to 

K-766; M-1 to L-765; M-1 to 1-764; M-1 to T-763; M-1 to G-762; M-1 to K-761; M-1 to 
V-760; M-1 to L-759; M-1 to 1-758; M-1 to D-757; M-1 to Q-756; M-1 to E-755; M-1 to 
1-754; M-1 to A-753; M-1 to S-752; M-1 to 1-751; M-1 to A-750; M-1 to L-749; M-1 to N- 
748; M-1 to G-747; M-1 to N-746; M-1 to L-745; M-1 to L-744; M-1 to Y-743; M-1 to Q- 

30 742; M-1 to G-741; M-1 to D-740; M-1 to A-739; M-1 to T-738; M-1 to K-737; M-1 to L- 

736; M-1 to A-735; M-1 to L-734; M-1 to Y-733; M-1 to N-732; M-1 to G-731; M-1 to D- 
730; M-1 to N-729; M-1 to Q-728; M-1 to V-727; M-1 to G-726; M-1 to P-725; M-1 to H- 
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724; M-1 to S-723; M-1 to R-722; M-1 to Q-721; M-1 to K-720; M-1 to V-719; M-1 to D- 
718; M-1 to 1-717; M-1 to N-716; M-1 to T-715; M-1 to A-714; M-1 to G-713; M-1 to A- 
712; M-1 toP-711;M-l toI-710;M-l toT-709;M-l to V-708;M-1 toI-707;M-l toD-706; 
M-1 to N-705; M-1 to Y-704; M-1 to G-703; M-1 to Y-702; M-1 to N-701; M-1 to T-700; 
M-1 toP-699; M-1 to T-698; M-1 to L-697; M-1 to S-696; M-1 to G-695; M-1 to S-694; M- 
1 to V-693;M-1 toK-692;M-l toR-691;M-l toC-690;M-l toS-689;M-l toN-688;M-l 
to G-687; M-1 to K-686; M-1 to G-685; M-1 to G-684; M-1 to C-683; M-1 to V-682; M-1 
to G-681; M-1 to C-680; M-1 to K-679; M-1 to D-678; M-1 to L-677; M-1 to K-676; M-1 
to R-675; M-1 to P-674; M-1 to S-673; M-1 to D-672; M-1 to V-671; M-1 to V-670; M-1 
to H-669; M-1 to D-668; M-1 to C-667; M-1 to G-666; M-1 to A-665; M-1 to K-664; M-1 
to V-663; M-1 to C-662; M-1 to Q-661; M-1 to G-660; M-1 to R-659; M-1 to V-658; M-1 



to C-657; M-1 to 1-656; M-1 to A-655; M-1 to L-654; M-1 to T-653 



P-651; M-1 to G-650; M-1 to C-649; M-1 to L-648 
D-645; M-1 to 1-644; M-1 to V-643; M-1 to K-642 
F-639; M-1 to V-638; M-1 to K-637; M-1 to F-636 
R-633; M-1 to G-632; M-1 to R-631; M-1 to A-630 
F-627; M-1 to L-626; M-1 to K-625; M-1 to C-624 
R-621; M-1 to P-620; M-1 to S-619; M-1 to V-618 
Y-615; M-1 to K-614; M-1 to P-613; M-1 to V-612 
L-609; M-1 to L-608; M-1 to N-607; M-1 to G-606 
D-603; M-1 to T-602; M-1 to Y-601; M-1 to N-600 
N-597; M-1 to Y-596; M-1 to K-595; M-1 to E-594 
Q-591; M-1 to E-590; M-1 to R-589; M-1 to F-588 
G-585; M-1 to D-584; M-1 to P-583; M-1 to P-582 
E-579; M-1 to T-578; M-1 to H-577; M-1 to C-576 
Y-573; M-1 to K-572; M-1 to A-571; M-1 to R-570 
L-567; M-1 to C-566; M-1 to Y-565; M-1 to R-564 
N-561; M-1 to Q-560; M-1 to P-559; M-1 to E-558 
K-555; M-1 to C-554; M-1 to E-553; M-1 to R-552 
F-549; M-1 to Q-548; M-1 to V-547; M-1 to G-546 
C-543; M-1 to T-542; M-1 to R-541; M-1 to S-540 
G-537; M-1 to W-536; M-1 to P-535; M-1 to G-534 



M-1 to T-647 
M-1 to A-641 
M-1 toE-635 
M-1 toR-629 
M-1 to R-623 
M-1 to G-617 
M-ltoW-611 
M-1 to D-605 
M-1 to Y-599 
M-1 to C-593 
M-1 toS-587 
M-1 toC-581 
M-1 toS-575 
M-ltoR-569 
M-1 to G-563 
M-1 toP-557 
M-1 toH-551 
M-ltoG-545 
M-1 to C-539 
M-ltoW-533 



M-1 to E-652 
M-1 to G-646 
M-1 to E-640 
M-1 to S-634 
M-1 to C-628 
M-1 toD-622 
M-1 to A-616 
M-ltoQ-610 
M-1 toM-604 
M-1 to A-598 
M-1 toQ-592 
M-1 to K-586 
M-1 to E-580 
M-1 to Q-574 
M-1 toG-568 
M-1 toG-562 
M-1 to D-556 
M-1 to S-550 
M-1 to G-544 
M-1 to E-538 
M-ltoP-532 



M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
M-1 to 
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A-531; M-1 to W-530; M-1 to G-529; M-1 to G-528; M-1 to D-527; M-1 to V-526; M-1 to 
V-525; M-1 to P-524; M-1 to K-523; M-1 to P-522; M-1 to R-521; M-1 to E-520; M-1 to 
V-519; M-1 to E-518; M-1 to E-517; M-1 to E-516; M-1 to P-515; M-1 to L-514; M-1 to 
C-513; M-1 to S-512; M-1 to G-511; M-1 to E-510; M-1 to S-509; M-1 to C-508; M-1 to 
L-507; M-1 to H-506; M-1 to G-505; M-1 to P-504; M-1 to G-503; M-1 to C-502; M-1 to 
P-501; M-1 to T-500; M-1 to G-499; M-1 to D-498; M-1 to A-497; M-1 to W-496; M-1 to 
P-495; M-1 to L-494; M-1 to S-493; M-1 to G-492; M-1 to N-491; M-1 to K-490; M-1 to 
T-489; M-1 to H-488; M-1 to C-487; M-1 to L-486; M-1 to P-485; M-1 to E-484; M-1 to 
A-483; M-1 to G-482; M-1 to D-481; M-1 to T-480; M-1 to H-479; M-1 to C-478; M-1 to 
W-477; M-1 to L-476; M-1 to Q-475; M-1 to A-474; M-1 to C-473; M-1 to V-472; M-1 to 
D-471; M-1 to Q-470; M-1 to A-469; M-1 to S-468; M-1 to T-467; M-1 to N-466; M-1 to 
P-465; M-1 to C-464; M-1 to H-463; M-1 to R-462; M-1 to F-461; M-1 to D-460; M-1 to 
P-459; M-1 to G-458; M-1 to F-457; M-1 to 1-456; M-1 to Q-455; M-1 to R-454; M-1 to C- 
453; M-1 to Q-452; M-1 to Q-451; M-1 to D-450; M-1 to L-449; M-1 to Q-448; M-1 to Y- 
447; M-1 to L-446; M-1 to A-445; M-1 to M-444; M-1 to R-443; M-1 to G-442; M-1 to P- 
441; M-1 to L-440; M-1 to G-439; M-1 to T-438; M-1 to P-437; M-1 to 1^436; M-1 to P- 
435; M-1 to L-434; M-1 to A-433; M-1 to A-432; M-1 to G-43 1; M-1 to P-430; M-1 to A- 
429; M-1 to D-428; M-1 to L-427; M-1 to L-426; M-1 to C-425; M-1 to D-424; M-1 to G- 
423; M-1 to H-422; M-1 to G-421; M-1 to G-420; M-1 to D-419; M-1 to L-418; M-1 to L- 
417; M-1 to E-416; M-1 to T-415; M-1 to L-414; M-1 to Y-413; M-1 to M-412; M-1 to A- 
411; M-1 to S-410; M-1 to C-409; M-1 to P-408; M-1 to S-407; M-1 to W-406; M-1 to P- 
405; M-1 to L-404; M-1 to T-403; M-1 to Q-402; M-1 to N-401; M-1 to L-400; M-1 to H- 
399; M-1 to V-398; M-1 to F-397; M-1 to L-396; M-1 to P-395; M-1 to A-394; M-1 to M- 
393; M-1 to V-392; M-1 to H-391; M-1 to H-390; M-1 to K-389; M-1 to G-388; M-1 to M- 
387; M-1 to P-386; M-1 to G-385; M-1 to F-384; M-1 to L-383; M-1 to R-382; M-1 to T- 
381; M-1 to C-380; M-1 to P-379; M-1 to K-378; M-1 to S-377; M-1 to D-376; M-1 to D- 
375; M-1 to H-374; M-1 to P-373; M-1 to M-372; M-1 to S-371; M-1 to L-370; M-1 to V- 
369; M-1 to H-368; M-1 to G-367; M-1 to L-366; M-1 to E-365; M-1 to H-364; M-1 to A- 
363; M-1 to L-362; M-1 to T-361; M-1 to H-360; M-1 to A-359; M-1 to A-358; M-1 to Q- 
357; M-1 to L-356; M-1 to G-355; M-1 to E-354; M-1 to D-353; M-1 to E-352; M-1 to I- 
351; M-1 to V-350; M-1 to S-349; M-1 to C-348; M-1 to S-347; M-1 to K-346; M-1 to N- 
345; M-1 to P-344; M-1 to D-343; M-1 to C-342; M-1 to 1-341; M-1 to T-340; M-1 to G- 
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339; M-1 to 1-338; M-1 to D-337; M-1 to A-336; M-1 to V-335; M-1 to G-334; M-1 to L- 
333; M-1 to T-332; M-1 to D-331; M-1 to C-330; M-1 to L-329; M-1 to G-328; M-1 to E- 
327; M-1 to Q-326; M-1 to G-325; M-1 to C-324; M-1 to F-323; M-1 to N-322; M-1 to Q- 
321; M-1 to R-320; M-1 to T-319; M-1 to L-318; M-1 to L-317; M-1 to 1-316; M-1 to A- 
315; M-1 to T-314; M-1 to D-313; M-1 to Y-312; M-1 to H-31 1; M-1 to E-310; M-1 to P- 
309; M-1 to H-308; M-1 to R-307; M-1 to D-306; M-1 to S-305; M-1 to P-304; M-1 to Q- 
303; M-1 to N-302; M-1 to F-301; M-1 to R-300; M-1 to R-299; M-1 to Q-298; M-1 to W- 
297; M-1 to N-296; M-1 to C-295; M-1 to F-294; M-1 to N-293; M-1 to R-292; M-1 to L- 
291; M-1 to T-290; M-1 to L-289; M-1 to G-288; M-1 to G-287; M-1 to N-286; M-1 to D- 
285; M-1 to S-284; M-1 to V-283; M-1 to E-282; M-1 to P-281; M-1 to G-280; M-1 to W- 
279; M-1 to K-278; M-1 to E-277; M-1 to D-276; M-1 to E-275; M-1 to V-274; M-1 to I- 
273; M-1 to L-272; M-1 to V-271; M-1 to K-270; M-1 to V-269; M-1 to V-268; M-1 to M- 
267; M-1 to L-266; M-1 to N-265; M-1 to 1-264; M-1 to S-263; M-1 to N-262; M-1 to K- 
261; M-1 to 1-260; M-1 to S-259; M-1 to P-258; M-1 to H-257; M-1 to K-256; M-1 to Y- 
255; M-1 to 1-254; M-1 to R-253; M-1 to A-252; M-1 to A-251; M-1 to V-250; M-1 to S- 
249; M-1 to M-248; M-1 to L-247; M-1 to T-246; M-1 to L-245; M-1 to 1-244; M-1 to H- 
243; M-1 to N-242; M-1 to Q-241; M-1 to L-240; M-1 to D-239; M-1 to A-238; M-1 to G- 
237; M-1 to Y-236; M-1 to F-235; M-1 to A-234; M-1 to A-233; M-1 to M-232; M-1 to S- 
231; M-1 to A-230; M-1 to D-229; M-1 to A-228; M-1 to V-227; M-1 to L-226; M-1 to L- 
225; M-1 to T-224; M-1 to E-223; M-1 to V-222; M-1 to F-221; M-1 to R-220; M-1 to A- 
219; M-1 to E-218; M-1 to S-217; M-1 to V-216; M-1 to F-215; M-1 to R-214; M-1 to K- 
213; M-1 to T-212; M-1 to R-21 1; M-1 to S-210; M-1 to T-209; M-1 to A-208; M-1 to G- 
207; M-1 to L-206; M-1 to P-205; M-1 to P-204; M-1 to P-203; M-1 to P-202; M-1 to E- 
201; M-1 to S-200; M-1 to A-199; M-1 to G-198; M-1 to E-197; M-1 to A-196; M-1 to E- 
195; M-1 to E-194; M-1 to E-193; M-1 to Q-192; M-1 to S-191; M-1 to E-190; M-1 to E- 
189; M-1 to E-188; M-1 to S-187; M-1 to D-186; M-1 to E-185; M-1 to Q-184; M-1 to H- 
183; M-1 to D-182; M-1 to G-181; M-1 to R-180; M-1 to E-179; M-1 to Q-178; M-1 to R- 
177; M-1 to Q-176; M-1 to G-175; M-1 to E-174; M-1 to G-173; M-1 to T-172; M-1 to E- 
171; M-1 to V-170; M-1 to E-169; M-1 to W-168; M-1 to E-167; M-1 to P-166; M-1 to G- 
165; M-1 to R-164; M-1 to P-163; M-1 to L-162; M-1 to P-161; M-1 to R-160; M-1 to A- 
159; M-1 to G-158; M-1 to A-157; M-1 to P-156; M-1 to G-155; M-1 to W-154; M-1 to R- 
153; M-1 to Q-152; M-1 to L-151; M-1 to R-150; M-1 to H-149; M-1 to P-148; M-1 to Q- 
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147; M-1 to A-146; M-1 to L-145; M-1 to S-144; M-1 to G-143; M-1 to G-142; M-1 to A- 
141; M-1 to G-140; M-1 to Q-139; M-1 to P-138; M-1 to Q-137; M-1 to 1-136; M-1 to T- 
135; M-1 to F-134; M-1 to E-133; M-1 to E-132; M-1 to G-131; M-1 to D-130; M-1 to L- 
129; M-1 to L-128; M-1 to F-127; M-1 to S-126; M-1 to G-125; M-1 to S-124; M-1 to L- 
5 123; M-1 to G-122; M-1 to R-121; M-1 to C-120; M-1 to L-1 19; M-1 to S-118; M-1 to V- 

117; M-1 to A-116;M-1 to A-115; M-1 to L-1 14; M-1 to S-113; M-1 toE-112; M-1 toP- 
111; M-1 to E-110; M-1 to G-109;M-1 toN-108;M-l to V-107; M-1 toT-106; M-1 toG- 
105; M-1 to S-104; M-1 to F-103; M-1 toF-102; M-1 to C-101; M-1 to G-lOO; M-1 to R-99; 
M-1 to L-98; M-1 to G-97; M-1 to R-96; M-1 to E-95; M-1 to G-94; M-1 to G-93; M-1 to 

10 T-92; M-1 to A- 91; M-1 to R-90; M-1 to G-89; M-1 to S-88; M-1 to G-87; M-1 to G-86; 

M-1 to L-85; M-1 to R-84; M-1 to E-83; M-1 to 1-82; M-1 to K-81; M-1 to F-80; M-1 to E- 
79; M-1 to P-78; M-1 to A-77; M-1 to L-76; M-1 to F-75; M-1 to S-74; M-1 to D-73; M-1 
to D-72; M-1 to P-71; M-1 to A-70; M-1 to L-69; M-1 to R-68; M-1 to L-67; M-1 to V-66; 
M-1 to F-65; M-1 to G-64; M-1 to K-63; M-1 to G-62; M-1 to F-61; M-1 to A-60; M-1 to 

15 S-59; M-1 toL-58; M-1 to H-57; M-1 to L-56; M-1 to A-55; M-1 to L-54; M-1 to E-53; M-1 

to G-52; M-1 to A-51; M-1 to S-50; M-1 to G-49; M-1 to P-48; M-1 to L-47; M-1 to R-46; 
M-1 to T-45; M-1 to P-44; M-1 to V-43; M-1 to V-42; M-1 to L-41; M-1 to E-40; M-1 to 
S-39; M-1 to A-38; M-1 to Q-37; M-1 to G-36; M-1 to G-35; M-1 to A-34; M-1 to A-33; 
M-1 to P-32; M-1 to R-31; M-1 to A-30; M-1 to P-29; M-1 to A-28; M-1 to G-27; M-1 to 

20 R-26; M-1 to A-25; M-1 to L-24; M-1 to P-23; M-1 to L-22; M-1 to L-21; M-1 to L-20; M-1 

to L-19; M-1 to L-18; M-1 to L-17; M-1 to L-16; M-1 to L-15; M-1 to L-14; M-1 to F-13; 
M-1 to P-12; M-1 to L-1 1; M-1 to W-10; M-1 to R-9; M-1 to P-8; M-1 to A-7; of SEQ ID 
NO:4. Preferably, any of the above listed N- or C-terminal deletions can be combined to 
produce a N- and C-terminal deleted METH2 polypeptide. 

25 The invention also provides polypeptides having one or more amino acids deleted 

from both the amino and the carboxyl termini, which may be described generally as having 
residues mj-nj of SEQ ID NO:2 or m2-n2 SEQ ID NO:4, where n and m are integers as 
described above. 

The invention also provides mutants of the metalloprotease domain of METHl, 
30 which are described by the general formula ms-nj, where is an integer from 205 to 265, 

and n, is an integer from 285 to 950, where and correspond to the position of the 
amino acid residue identified in SEQ ID NO:2. The invention further provides mutants of 
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the metalloprotease domain of METH 1 , which are described by the general formula m4-n4, 
where m4 is an integer from 1 to 409, and is an integer from 429 to 489, where m4 and n4 
correspond to the position of the amino acid residue identified in SEQ ID NO:2. 

The invention also provides mutants of the disintegrin domain of METHl , which are 
described by the general formula m5-n5, where is an integer from 430 to 490, and is 
an integer from 510 to 950, where and correspond to the position of the amino acid 
residue identified in SEQ ID NO:2. The invention further provides mutants of the 
disintegrin domain of METH 1, which are described by the general formula m^-ng, where 
mg is an integer from 1 to 494, and n^ is an integer from 514 to 574, where m^ and n^ 
correspond to the position of the amino acid residue identified in SEQ ID NO:2. 

The invention further provides mutants of the TSPl domain of METHl , which are 
described by the general formula m^-n^, where m^ is an integer from 515 to 575, and is 
an integer from 595 to 950, where m^ an n^ correspond to the position of the amino acid 
residue identified in SEQ ID NO:2. The invention also provides mutants of the TSPl 
domain of METHl, which are described by the general formula mg-n^, where mg is an 
integer from 1 to 548, and is an integer from 568 to 628, where mg and correspond to 
the position of the amino acid residue identified in SEQ ID NO: 2. 

The invention further provides mutants of the TSP2 domain of METHl, which are 
described by the general formula mg-ng, where m^ is an integer from 801 to 871, and n^ is 
an integer from 891 to 950, where m^ and Ug correspond to the position of the amino acid 
residue identified in SEQ ID NO:2. The invention also provides mutants of the TSP2 
domain of METHl, which are described by the general formula m^o-nio, where m^o is an 
integer from 1 to 834, and n^o is an integer from 864 to 924, where m^Q and n^o correspond 
to the position of the amino acid residue identified in SEQ ID NO:2. 

The invention further provides mutants of the TSP3 domain of METHl, which are 
described by the general formula m^^-n^^, where m^^ is an integer from 865 to 925, and n^ 
is an integer from 945 to 950, where m^^ and n^^ correspond to the position of the amino acid 
residue identified in SEQ ID NO:2. The invention also provides mutants of the TSP3 
domain of METHl, which are described by the general formula m^^-^^i^ where m^^ is an 
integer from 1 to 884, and is an integer from 904 to 950, where m^2 and n^2 correspond 
to the position of the amino acid residue identified in SEQ ID NO:2. 
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The invention further provides mutants of the metalloprotease domain of METH2, 
which are described by the general formula m^^-n^^, where mj3 is an integer from 1 84 to 244, 
and ni3 is an integer from 264 to 890, where m^^ and n^^ correspond to the position of the 
amino acid residue identified in SEQ ID NO:4. The invention also provides mutants of the 
5 metalloprotease domain of METH2, which are described by the general formula m^^^-n^^, 

where mi4 is an integer from 1 to 389, and nj4 is an integer from 409 to 469, where mj4 and 
nj4 correspond to the position of the amino acid residue identified in SEQ ID NO:4. 

The invention further provides mutants of the disintegrin domain of METH2, which 
are described by the general formula mi5-ni5, where mj5 is an integer from 400 to 470, and 

10 ni5 is an integer from 490 to 890, where m^^ and n^^^ correspond to the position of the amino 

acid residue identified in SEQ ID NO:4. The invention also provides mutants of the 
disintegrin domain of METH2, which are described by the general formula m^^-n^^, where 
mjg is an integer from 1 to 479, and n^g is an integer from 499 to 559, where m^^ and n^^ 
correspond to the position of the amino acid residue identified in SEQ ID NO:4. 

15 The invention further provides mutants of the TSPl domain of METH2, which are 

described by the general formula mi^-nj^, where m^j is an integer from 500 to 560, and n^^ 
is an integer from 580 to 890, where mj7 and n^^ correspond to the position of the amino acid 
residue identified in SEQ ED NO:4. The invention also provides mutants of the TSPl 
domain of METH2, which are described by the general formula m^g-Uig, where m^g is an 

20 integer from 1 to 533, and n^g is an integer from 553 to 613, where m^g and n^g correspond 

to the position of the amino acid residue identified in SEQ ID NO:4. 

The invention further provides mutants of the TSP2 domain of METH2, which are 
described by the general formula m^g-n^g, where mj^ is an integer from 807 to 867, and n^^ 
is an integer from 887 to 890, where m^^ and n^^ correspond to the position of the amino acid 

25 residue identified in SEQ ED NO:4. The invention also provides mutants of the TSP2 

domain of METH2, which are described by the general formula m2o-n2o, where m^o is an 
integer from 1 to 840, and n2o is an integer from 860 to 890, where m2o and n2o correspond 
to the position of the amino acid residue identified in SEQ ID NO:4. 

Also preferred are METHl or METH2 polypeptide and polynucleotide fragments 

30 characterized by structural or functional domains. Preferred embodiments of the invention 

include fragments that comprise alpha-helix and alpha-helix forming regions ("alpha- 
regions"), beta-sheet and beta-sheet-forming regions ("beta-regions"), tum and turn-forming 
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regions ("turn-regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. As set 
out in the Figures, such preferred regions include Garnier-Robson alpha-regions, beta- 
5 regions, turn-regions, and coil-regions, Chou-Fasman alpha-regions, beta-regions, and turn- 

regions, Kyte-Doolittle hydrophilic regions and hydrophobic regions, Eisenberg alpha and 
beta amphipathic regions, Karplus-Schulz flexible regions, Emini surface-forming regions, 
and Jameson-Wolf high antigenic index regions. Polypeptide fragments of SEQ ID NO:2 
falling within conserved domains are specifically contemplated by the present invention. 

10 (See Figures 10 & 11 and Tables 1& 2.) Moreover, polynucleotide fragments encoding 

these domains are also contemplated. 

Other preferred fragments are biologically active METHl or METH2 fragments. 
Biologically active fragments are those exhibiting activity similar, but not necessarily 
identical, to an activity of the METHl or METH2 polypeptide. The biological activity of 

1 5 the fragments may include an improved desired activity, or a decreased undesirable activity. 

However, many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are related 
to SEQ ID NO: 1 or SEQ ID NO:3 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically excluded 

20 from the scope of the present invention. To list every related sequence would be 

cumbersome. Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a-b, 
where a is any integer between 1 to 936 of SEQ ID NO:l, b is an integer of 15 to 950, 
where both a and b correspond to the positions of nucleotide residues shown in SEQ ID 

25 NO: 1, and where the b is greater than or equal to a + 14. Moreover, preferably excluded 

from the present invention are one or more polynucleotides comprising a nucleotide 
sequence described by the general formula of a-b, where a is any integer between 1 to 876 
of SEQ ID NO: 3, b is an integer of 15 to 890, where both a and b correspond to the positions 
of nucleotide residues shown in SEQ ID NO: 3, and where the b is greater than or equal to 

30 a + 14. 

The above-described fragments may be used to make fusion proteins, for example 
Fc or Flag fusion proteins, as described below. 
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Epitopes & Antibodies 

In another aspect, the invention provides peptides and polypeptides comprising 
epitope-bearing portions of the polypeptides of the present invention. These epitopes are 
immunogenic or antigenic epitopes of the polypeptides of the present invention. An 
5 "immunogenic epitope" is defined as a part of a protein that elicits an antibody response in 

vivo when the whole polypeptide of the present invention, or fragment thereof, is the 
immunogen. On the other hand, a region of a polypeptide to which an antibody can bind is 
defined as an "antigenic determinant" or "antigenic epitope." The number of in vivo 
inamunogenic epitopes of a protein generally is less than the number of antigenic epitopes. 

10 See, e.g., Geysen, et al (1983) Proc. Natl Acad. ScL USA Si:3998- 4002. However, 

antibodies can be made to any antigenic epitope, regardless of whether it is an immunogenic 
epitope, by using methods such as phage display. See e.g., Petersen G. et al (1995) Mol 
Gen, Genet. 249:425-431. Therefore, included in the present invention are both 
immunogenic epitopes and antigenic epitopes. 

15 A list of exemplified amino acid sequences comprising immunogenic epitopes are 

shown in Tables 1 and 2. It is pointed out that Tables 1 and 2 only list amino acid residues 
comprising epitopes predicted to have the highest degree of antigenicity using the algorithm 
of Jameson and Wolf, (1988) Comp. Appl BioscL 4: 181-186 (said references incorporated 
by reference in their entireties). The Jameson- Wolf antigenic analysis was performed using 

20 the computer program PROTEAN, using default parameters (Version 3.11 for the Power 

Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). Tables 1 and 2 and 
portions of polypeptides not listed in Tables 1 and 2 are not considered non-immunogenic. 
The immunogenic epitopes of Tables 1 and 2 are exempUfied lists, not exhaustive lists, 
because other immunogenic epitopes are merely not recognized as such by the particular 

25 algorithm used. Amino acid residues comprising other immunogenic epitopes may be 

routinely determined using algorithms similar to the Jameson- Wolf analysis or by in vivo 
testing for an antigenic response using methods known in the art. See, e.g., Geysen et al, 
supra; U.S. Patents 4,708,781; 5,194,392; 4,433,092; and 5,480,971 (said references 
incorporated by reference in their entireties). 

30 Antigenic epitope-bearing peptides and polypeptides of the invention preferably 

contain a sequence of at least seven, more preferably at least nine and most preferably 
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between about 15 to about 30 amino acids contained within the amino acid sequence of a 
polypeptide of the invention. 

Using DNAstar analysis, SEQ ED NO:2 was found antigenic at amino acids: 2-14, 
32-44,47-60,66-78,87-103, 109-118, 146-162, 168-180, 183-219,223-243,275-284, 296- 
306, 314-334, 341-354, 357-376, 392-399, 401-410, 418-429, 438-454, 456-471, 474-488, 
510-522, 524-538, 550-561, 565-626, 630-643, 659-671, 679-721, 734-749, 784-804, 813- 
820, 825-832, 845-854, 860-894, 899-917, 919-924 and 928-939. 

Using DNAstar analysis, SEQ ID NO:4 was found antigenic at amino acids: 26-38, 
45-52,69-76, 80-99,105-113, 129-136,138-217,254-263,273-289,294-313,321-331,339- 
356, 371-383, 417-427, 438-443, 459-471, 479-505, 507-526, 535-546, 550-607, 615-640, 
648-653, 660-667, 669-681, 683-704, 717-732, 737-743, 775-787, 797-804, 81 1-825, 840- 
867 and 870-884. 

Thus, these regions of METHl and/or METH2 are non-limiting examples of 
antigenic polypeptides or peptides that can be used to raise METH 1 and/or METH2 -specific 
antibodies include. 

It is particularly pointed out that the amino acid sequences of Tables 1 and 2 
comprise immunogenic epitopes. Tables 1 and 2 list only the critical residues of 
immunogenic epitopes determined by the Jameson- Wolf analysis. Thus, additional flanking 
residues on either the N-terminal, C-terminal, or both N- and C-terminal ends may be added 
to the sequences of Tables 1 and 2 to generate an epitope-bearing polypeptide of the present 
invention. Therefore, the immunogenic epitopes of Tables 1 and 2 may include additional 
N-terminal or C-terminal amino acid residues. The additional flanking anaino acid residues 
may be contiguous flanking N-terminal and/or C-terminal sequences from the polypeptides 
of the present invention, heterologous polypeptide sequences, or may include both 
contiguous flanking sequences from the polypeptides of the present invention and 
heterologous polypeptide sequences. 

Polypeptides of the present invention comprising immunogenic or antigenic epitopes 
are at least 7 amino acids residues in length. "At least" means that a polypeptide of the 
present invention comprising an immunogenic or antigenic epitope may be 7 amino acid 
residues in length or any integer between 7 amino acids and the number of amino acid 
residues of the full length polypeptides of the invention. Preferred polypeptides comprising 
immunogenic or antigenic epitopes are at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 
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70, 75, 80, 85, 90, 95, or 100 amino acid residues in length. However, it is pointed out that 
each and every integer between 7 and the number of amino acid residues of the full length 
polypeptide are included in the present invention. 

The immuno and antigenic epitope-bearing fragments may be specified by either the 
number of contiguous amino acid residues, as described above, or further specified by N- 
terminal and C-terminal positions of these fragments on the amino acid sequence of SEQ 
ID NO:2 or 4. Every combination of a N-terminal and C-terminal position that a fragment 
of, for example, at least 7 or at least 15 contiguous amino acid residues in length could 
occupy on the amino acid sequence of SEQ ID NO:2 or 4 is included in the invention. 
Again, "at least 7 contiguous amino acid residues in length" means 7 amino acid residues 
in length or any integer between 7 amino acids and the number of amino acid residues of the 
full length polypeptide of the present invention. Specifically, each and every integer between 
7 and the number of amino acid residues of the full length polypeptide are included in the 
present invention. 

Immunogenic and antigenic epitope-bearing polypeptides of the invention are useful, 
for example, to make antibodies which specifically bind the polypeptides of the invention, 
and in immunoassays to detect the polypeptides of the present invention. The antibodies are 
useful, for example, in affinity purification of the polypeptides of the present invention. The 
antibodies may also routinely be used in a variety of quaUtative or quantitative 
inununoassays, specifically for the polypeptides of the present invention using methods 
known in the art. See, e.g., Harlow et al. Antibodies: A Laboratory Manual, (Cold Spring 
Harbor Laboratory Press; 2nd Ed. 1988). 

The epitope-bearing polypeptides of the present invention may be produced by any 
conventional means for making polypeptides including synthetic and recombinant methods 
known in the art. For instance, epitope-bearing peptides may be synthesized using known 
methods of chemical synthesis. For instance, Houghten has described a simple method for 
the synthesis of large numbers of peptides, such as 10-20 mgs of 248 individual and distinct 
13 residue peptides representing single amino acid variants of a segment of the HAl 
polypeptide, all of which were prepared and characterized (by ELIS A-type binding studies) 
in less than four weeks (Houghten, R. A. Proc. Natl Acad. Set USA 82:5131-5135 (1985)). 
This "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S . 
Patent No. 4,631,21 1 to Houghten and coworkers (1986). In this procedure the individual 
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resins for the solid-phase synthesis of various peptides are contained in separate 
solvent-permeable packets, enabling the optimal use of the many identical repetitive steps 
involved in solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al (1985) Proc, Natl Acad, ScL 
5 52:5131-5135 at 5134). 

Epitope-bearing polypeptides of the present invention are used to induce antibodies 
according to methods well known in the art including, but not limited to, in vivo 
immunization, in vitro immunization, and phage display methods. See, e.g., Sutcliffe, et al , 
supra; Wilson, a/,, supra, and Bittle, aZ. (1985)/. Gen, Virol 66:2341-2354, If invivo 
10 immunization is used, animals may be immunized with free peptide; however, anti-peptide 

antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, such 
as keyhole hmpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing 
cysteine residues may be coupled to a carrier using a linker such as m-maleimidobenzoyl-N- 
hydroxysuccinimide ester (MBS), while other peptides may be coupled to carriers using a 
15 more general linking agent such as glutaraldehyde. Animals such as rabbits, rats and mice 

are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal 
and/or intradermal injection of emulsions containing about 100 |Ligs of peptide or carrier 
protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be 
20 detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The 

titer of anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a solid 
support and elution of the selected antibodies according to methods well known in the art. 
As one of skill in the art will appreciate, and discussed above, the polypeptides of 
25 the present invention comprising an immunogenic or antigenic epitope can be fused to 

heterologous polypeptide sequences. For example, the polypeptides of the present invention 
may be fused with the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), or 
portions thereof (CHI, CH2, CH3, any combination thereof including both entire domains 
and portions thereof) resulting in chimeric polypeptides. These fusion proteins facilitate 
30 purification, and show an increased half-life in vivo. This has been shown, e.g. , for chimeric 

proteins consisting of the first two domains of the human CD4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian 
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immunoglobulins. See, e.g., EPA 0,394,827; Traunecker et al (1988) Nature 331:84-86. 
Fusion proteins that have a disulfide-linked dimeric structure due to the IgG portion can also 
be more efficient in binding and neutralizing other molecules than monomeric polypeptides 
or fragments thereof alone. See, e.g., Fountoulakis etaL (1995) /. Biochem. 270:3958-3964. 

5 Nucleic acids encoding the above epitopes can also be recombined with a gene of interest 

as an epitope tag to aid in detection and purification of the expressed polypeptide. 

The present invention further relates to antibodies and T-cell antigen receptors (TCR) 
which specifically bind the polypeptides of the present invention. The antibodies of the 
present invention include IgG (including IgGl , IgG2, IgG3, and IgG4), IgA (including IgAl 

10 and IgA2), IgD, IgE, or IgM, and IgY. As used herein, the term "antibody" (Ab) is meant 

to include whole antibodies, including single-chain whole antibodies, and antigen-binding 
fragments thereof. Most preferably the antibodies are human antigen binding antibody 
fragments of the present invention including, but not limited to. Fab, Fab' and F(ab')2, Fd, 
single-chain Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments 

15 comprising either a or domain. The antibodies may be from any animal origin 

including birds and mammals. Preferably, the antibodies are human, murine, rabbit, goat, 
guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may 
comprise the variable region(s) alone or in combination with the entire or partial of the 

20 following: hinge region, CHI, CH2, and CH3 domains. Also included in the invention are 

any combinations of variable region(s) and hinge region, CHI, CH2, and CH3 domains. The 
present invention further includes monoclonal, polyclonal, chimeric, humanized, and human 
monoclonal and polyclonal antibodies which specifically bind the polypeptides of the 
present invention. The present invention further includes antibodies which are anti-idiotypic 

25 to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, trispecific 
or of greater multispecificity . Multispecific antibodies may be specific for different epitopes 
of a polypeptide of the present invention or may be specific for both a polypeptide of the 
present invention as well as for heterologous compositions, such as a heterologous 

30 polypeptide or solid support material. See, e.g., WO 93/17715; WO 92/08802; WO 

91/00360; WO 92/05793; Tutt, A. et al (1991) /. Immunol 747:60-69; US Patents 
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5,573,920, 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny, S,A. et al (1992) J. 
Immunol 748:1547-1553. 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or portion(s) of a polypeptide of the present invention which are recognized or 
5 specifically bound by the antibody. The epitope(s) or polypeptide portion(s) may be 

specified as described herein, e.g., by N-terminal and C-terminal positions, by size in 
contiguous amino acid residues, or listed in the Tables and Figures. Antibodies which 
specifically bind any epitope or polypeptide of the present invention may also be excluded. 
Therefore, the present invention includes antibodies that specifically bind polypeptides of 

10 the present invention, and allows for the exclusion of the same. 

Antibodies of the present invention may also be described or specified in terms of 
their cross-reactivity. Antibodies that do not bind any other analog, ortholog, or homolog 
of the polypeptides of the present invention are included. Antibodies that do not bind 
polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less than 

15 75%, less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% identity 

(as calculated using methods known in the art and described herein) to a polypeptide of the 
present invention are also included in the present invention. Further included in the present 
invention are antibodies which only bind polypeptides encoded by polynucleotides which 
hybridize to a polynucleotide of the present invention under stringent hybridization 

20 conditions (as described herein). Antibodies of the present invention may also be described 

or specified in terms of their binding affinity. Preferred binding affinities include those with 
a dissociation constant or Kd less than 5X10'^M, 10~^M, 5X10-''M, lO-'^M, 5X10-^M, lO'^M, 
5X10-'M, 10-^M,5X10-^^M, 10-^'M,5X10-''M, 10-^^M,5X10-^'M, 10-''M,5X10-^X lO'^^M, 
5X10-i^M, lO-'^M, 5X10-i'M, and lO'^'M. 

25 Antibodies of the present invention have uses that include, but are not limited to, 

methods known in the art to purify, detect, and target the polypeptides of the present 
invention including both in vitro and in vivo diagnostic and therapeutic methods. For 
example, the antibodies have use in immunoassays for qualitatively and quantitatively 
measuring levels of the polypeptides of the present invention in biological samples. See, e.g., 

30 Harlow et al, ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 

Laboratory Press, 2nd ed. 1988) (incorporated by reference in the entirety). 
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The antibodies of the present invention may be used either alone or in combination 
with other compositions. The antibodies may further be recombinantly fused to a 
heterologous polypeptide at the N- or C-terminus or chemically conjugated (including 
covalently and non-covalently conjugations) to polypeptides or other compositions. For 
5 example, antibodies of the present invention may be recombinantly fused or conjugated to 

molecules useful as labels in detection assays and effector molecules such as heterologous 
polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/12624; US 
Patent 5,314,995; and EP 0 396 387. 

The antibodies of the present invention may be prepared by any suitable method 

10 known in the art. For example, a polypeptide of the present invention or an antigenic 

fragment thereof can be administered to an animal in order to induce the production of sera 
containing polyclonal antibodies. The term "monoclonal antibody" is not limited to 
antibodies produced through hybridoma technology. The term "monoclonal antibody" refers 
to an antibody that is derived from a single clone, including eukaryotic, prokaryotic, or 

15 phage clones, and not by the method which it is produced. Monoclonal antibodies can be 

prepared using a wide variety of techniques known in the art including the use of hybridoma, 
recombinant and phage display technology. 

Hybridoma techniques include those known in the art and taught in Harlow et al , 
ANTIBODIES : A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 

20 ed. 1988); Hammerling, et al, in: MONOCLONAL ANTIBODIES AND T-CELL 

HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981) (said references incorporated by reference 
in their entireties). Fab and F(ab')2 fragments may be produced by proteolytic cleavage, 
using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). 

25 Alternatively, antibodies of the present invention can be produced through the 

application of recombinant DNA and phage display technology or through synthetic 
chemistry using methods known in the art. For example, the antibodies of the present 
invention can be prepared using various phage display methods known in the art. In phage 
display methods, functional antibody domains are displayed on the surface of a phage 

30 particle which carries polynucleotide sequences encoding them. Phage with a desired 

binding property are selected from a repertoire or combinatorial antibody library (e.g. human 
or murine) by selecting directly with antigen, typically antigen bound or captured to a solid 
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surface or bead. Phage used in these methods are typically filamentous phage including fd 
and Ml 3 with Fab, Fv or disulfide stabihzed Fv antibody domains recombinantly fused to 
either the phage gene III or gene Vm protein. Examples of phage display methods that can 
be used to make the antibodies of the present invention include those disclosed in Brinkman 

5 U. et al (1995) /. Immunol. Methods 752:41-50; Ames, R.S. et al (1995) /. Immunol 

Methods iS4: 177-186; Kettleborough, C.A. et al (1994) Eur, J. Immunol 24:952-958; 
Persic, L. etal (1997) Gene iS7:9-18; Burton, D.R. etal (1994) Advances in Immunology 
57:191-280; PCT/GB9 1/0 1134; WO 90/02809; WO 91/10737; WO 92/01047; WO 
92/18619; WO 93/11236; WO 95/15982; WO 95/20401; and US Patents 5,698,426, 

10 5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 5,821,047, 5,571,698, 5,427,908, 

5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references incorporated by reference 
in their entireties). 

As described in the above references, after phage selection, the antibody coding 
regions from the phage can be isolated and used to generate whole antibodies, including 

15 human antibodies, or any other desired antigen binding fragment, and expressed in any 

desired host including mammalian cells, insect cells, plant cells, yeast, and bacteria. For 
example, techniques to recombinantly produce Fab, Fab' and F(ab')2 fragments can also be 
employed using methods known in the art such as those disclosed in WO 92/22324; 
MuUinax, R.L. etal (1992) BioTechniques 72(5^:864-869; and Sawai, H. etal (1995) AJRI 

20 34:26-34; and Better, M. etal (1988) Science 240:1041-1043 (said references incorporated 

by reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and 
antibodies include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al 
(1991) Methods in Enzymology 205:46-88; Shu, L. et al (1993) PNAS 90:7995-7999; and 

25 Skerra, A, et al (1988) Science 240:1038-1040. For some uses, including in vivo use of 

antibodies in humans and in vitro detection assays, it may be preferable to use chimeric, 
humanized, or human antibodies. Methods for producing chimeric antibodies are known in 
the art. Seee.g., Morrison, Science 229: 1202 (1985); Oi etal, BioTechniques 4:214 (1986); 
Gillies, S.D. et al (1989) /. Immunol Methods 725:191-202; and US Patent 5,807,715. 

30 Antibodies can be humanized using a variety of techniques including CDR-grafting (EP 0 

239 400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing (EP 
0 592 106; EP 0 519 596; Padlan E.A., (1991) Molecular Immunology 2Sf4/5j:489-498; 
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StudnickaG.M. etal (1994) Protein Engineering 7(5j:805-814; RoguskaM.A. etal (1994) 
PNAS 9i :969-973), and chain shuffling (US Patent 5,565,332). Human antibodies can be 
made by a variety of methods known in the art including phage display methods described 
above. See also, US Patents 4,444,887, 4,716,111, 5,545,806, and 5,814,318; and WO 
5 98/46645 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or 
chemically conjugated (including both covalently and non-covalently conjugations) to a 
polypeptide of the present invention. The antibodies may be specific for antigens other than 
polypeptides of the present invention. For example, antibodies may be used to target the 

10 polypeptides of the present invention to particular cell types, either in vitro or in vivo, by 

fusing or conjugating the polypeptides of the present invention to antibodies specific for 
particular cell surface receptors. Antibodies fused or conjugated to the polypeptides of the 
present invention may also be used in in vitro immunoassays and purification methods 
using methods known in the art. See e.g.. Harbor et al supra and WO 93/21232; EP 0 439 

15 095; Naramura, M. etal (1994) Immunol Lett 39:91-99; US Patent 5,474,98 1 ; Gillies, S.O. 

etal (1992) PAPAS' S9:1428-1432; Fell, H.P. etal (1991) J. Immunol 146:2446-2452 (said 
references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of 
the present invention fused or conjugated to antibody domains other than the variable 

20 regions. For example, the polypeptides of the present invention may be fused or conjugated 

to an antibody Fc region, or portion thereof. The antibody portion fused to a polypeptide of 
the present invention may comprise the hinge region, CHI domain, CH2 domain, and CH3 
domain or any combination of whole domains or portions thereof. The polypeptides of the 
present invention may be fused or conjugated to the above antibody portions to increase the 

25 in vivo half life of the polypeptides or for use in immunoassays using methods known in the 

art. The polypeptides may also be fused or conjugated to the above antibody portions to 
form multimers. For example, Fc portions fused to the polypeptides of the present invention 
can form dimers through disulfide bonding between the Fc portions. Higher multimeric 
forms can be made by fusing the polypeptides to portions of IgA and IgM. Methods for 

30 fusing or conjugating the polypeptides of the present invention to antibody portions are 

known in the art. See e.g., US Patents 5,336,603, 5,622,929, 5,359,046, 5,349,053, 
5,447,851, 5,112,946; EP 0 307 434, EP 0 367 166; WO 96/04388, WO 91/06570; 
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Ashkenazi, A. et aZ. (1991) PNAS 88:10535-10539; Zheng, X.X. etal (1995) 7. Immunol 
754:5590-5600; and Vil, H. et al (1992) PNAS 89:11337-11341 (said references 
incorporated by reference in their entireties). 

The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. Antibodies which act as agonists or antagonists of the 
polypeptides of the present invention include, for example, antibodies which disrupt 
receptor/ligand interactions with the polypeptides of the invention either partially or fully. 
For example, the present invention includes antibodies that disrupt the ability of the proteins 
of the invention to multimerize. In another example, the present invention includes 
antibodies which allow the proteins of the invention to multimerize, but disrupt the abihty 
of the proteins of the invention to bind one or more METHl and/or METH2 
receptor(s)/ligand(s). In yet another example, the present invention includes antibodies 
which allow the proteins of the invention to multimerize, and bind METHl and/or METH2 
receptor(s)/ligand(s), but blocks biological activity associated with the METHl and/or 
METH2/receptor/ligand complex. 

Antibodies which act as agonists or antagonists of the polypeptides of the present 
invention also include, both receptor-specific antibodies and ligand-specific antibodies. 
Included are receptor-specific antibodies that do not prevent ligand binding but prevent 
receptor activation. Receptor activation (i.e., signaling) may be determined by techniques 
described herein or otherwise known in the art. Also included are receptor-specific 
antibodies which both prevent ligand binding and receptor activation. Likewise, included 
are neutralizing antibodies which bind the ligand and prevent binding of the ligand to the 
receptor, as well as antibodies which bind the ligand, thereby preventing receptor activation, 
but do not prevent the ligand from binding the receptor. Further included are antibodies that 
activate the receptor. These antibodies may act as agonists for either all or less than all of 
the biological activities affected by ligand-mediated receptor activation. The antibodies may 
be specified as agonists or antagonists for biological activities comprising specific activities 
disclosed herein. The above antibody agonists can be made using methods known in the art. 
See e.g., WO 96/40281; US Patent Number 5,81 1,097; Deng, B. et a/., Blood 92(6): 1981- 
1988 (1998); Chen, Z. et al. Cancer Res, 5 8(1 6 ):366S-367 8 (1998); Harrop, J. A. et al, J. 
Immunol 161(4yAl%6-\19A (1998); Zhu, Z. etal. Cancer Res. 58fi5):3209-3214 (1998); 
Yoon, D.Y. et al, /. Immunol 1 60( 7 ):3 17 0-3 11 9 (1998); Prat, M. et al, J. Cell Set 
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lll(Pt2):237'241 (1998); Pitard, V, et al, J. Immunol Methods 205(2): 177 -190 (1997); 
Liautard, J. et ai. Cytokine 9(4j:233-241 (1997); Carlson, N.G. et at, J. Biol. Chem, 
272(17): 1 1295-1 1301 (1997); Taryman, R.E. etal. Neuron 14(4):755-762 (1995); MuUer, 
Y.A. et al. Structure <5(9): 1 153-1 167 (1998); Bartunek, P. et al. Cytokine S(ij:14-20 
5 (1996)(said references incorporated by reference in their entireties). 

As discussed above, antibodies to the METHl and/or METH2 proteins of the 
invention can, in turn, be utilized to generate anti-idiotype antibodies that "mimic" METHl 
and/or METH2 using techniques well known to those skilled in the art. (See, e.g., Greenspan 
& Bona, FASEB J, 7(5):437-444; (1989) and Nissinoff, J, Immunol 7 4 7fS): 2429-243 8 

10 (1991)). For example, antibodies which bind to METHl and/or METH2 and competitively 

inhibit METHl and/or METH2 multimerization and/or binding to ligand can be used to 
generate anti-idiotypes that "mimic" the METHl and/or METH2 mutimerization and/or 
binding domain and, as a consequence, bind to and neutralize METHl and/or METH2 
and/or its ligand. Such neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes 

15 can be used in therapeutic regimens to neutralize METHl and/or METH2 ligand. For 

example, such anti-idiotypic antibodies can be used to bind METHl and/or METH2, or to 
bind METHl and/or METH2 Ugands/receptors, and thereby block METHl and/or METH2 
biological activity. 



Fusion Proteins 

20 Any METHl or METH2 polypeptide can be used to generate fusion proteins. For 

example, the METHl or METH2 polypeptide, when fused to a second protein, can be used 
as an antigenic tag. Antibodies raised against the METHl or METH2 polypeptide can be 
used to indirectly detect the second protein by binding to the METHl or METH2. 
Moreover, because secreted proteins target cellular locations based on trafficking signals, 

25 the METH 1 or METH2 polypeptides can be used as a targeting molecule once fused to other 

proteins. 

Examples of domains that can be fused to METHl or METH2 polypeptides include 
not only heterologous signal sequences, but also other heterologous functional regions. The 
fusion does not necessarily need to be direct, but may occur through linker sequences. 
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In certain preferred embodiments, METHl or METH2 proteins of the invention 
comprise fusion proteins wherein the METHl or METH2 polypeptides are those described 
above as m^ni or m2-n2, respectively. In preferred embodiments, the application is directed 
to nucleic acid molecules at least 90%, 95%, 96%, 97%, 98% or 99% identical to the nucleic 
acid sequences encoding polypeptides having the amino acid sequence of the specific N- and 
C-terminal deletions recited herein. Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

Moreover, fusion proteins may also be engineered to improve characteristics of the 
METHl or METH2 polypeptide. For instance, a region of additional amino acids, 
particularly charged amino acids, maybe added to the N-terminus of the METHl or METH2 
polypeptide to improve stability and persistence during purification from the host cell or 
subsequent handling and storage. Also, peptide moieties may be added to the METHl or 
METH2 polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the METHl or METH2 polypeptide. The addition of peptide moieties to 
facihtate handling of polypeptides ^are familiar and routine techniques in the art. 

Moreover, METHl or METH2 polypeptides, including fragments, and specifically 
epitopes, can be combined with parts of the constant domain of immunoglobulins (IgG), 
resulting in chimeric polypeptides. These fusion proteins facilitate purification and show 
an increased half-life in vivo. One reported example describes chimeric proteins consisting 
of the first two domains of the human CD4-polypeptide and various domains of the constant 
regions of the heavy or Hght chains of mammalian immunoglobulins. (EP A 394,827; 
Traunecker et al. Nature 537:84-86 (1988).) Fusion proteins having disulfide-linked 
dimeric structures (due to the IgG) can also be more efficient in binding and neutralizing 
other molecules, than the monomeric secreted protein or protein fragment alone. 
(Fountoulakis et al., J. Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof In many cases, the Fc part in a fusion 
protein is beneficial in therapy and diagnosis, and thus can result in, for example, improved 
pharmacokinetic properties. (EP-A 0232 262.) Alternatively, deleting the Fc part after the 
fusion protein has been expressed, detected, and purified, would be desired. For example, 
the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen 
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for immunizations. In drug discovery, for example, human proteins, such as hIL-5, have 
been fused with Fc portions for the purpose of high-throughput screening assays to identify 
antagonists of hIL-5. (See, D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. 
Johanson et aL, J. Biol. Chem. 270:9459-9471 (1995).) 
5 Moreover, the METHl or METH2 polypeptides can be fused to marker sequences, 

such as a peptide which facilitates purification of METHl or METH2. In preferred 
embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as the tag 
provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), 
among others, many of which are commercially available. As described in Gentz et al, 

10 Proc, Natl Acad, Set USA 86:821-824 (1989), for instance, hexa-histidine provides for 

convenient purification of the fusion protein. Another peptide tag useful for purification, 
the "HA" tag, corresponds to an epitope derived from the influenza hemagglutinin protein, 
(Wilson et aL, Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the METHl or METH2 

15 polynucleotides or the polypeptides. 



Biological Activities of METHl and/or METH2 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, can be used in assays to test for one or more biological 
activities. If METHl and/or METH2 polynucleotides or polypeptides, or agonists or 
20 antagonists of METHl and/or METH2, do exhibit activity in a particular assay, it is likely 

that METHl and/or METH2 may be involved in the diseases associated with the biological 
activity. Therefore, METHl and/or METH2 could be used to treat the associated disease. 



Immune Activity 

25 METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 

of METHl and/or METH2, may be useful in treating deficiencies or disorders of the 
immune system, by activating or inhibiting the proliferation, differentiation, or mobilization 
(chemotaxis) of immune cells. Immune cells develop through a process called 
hematopoiesis, producing myeloid (platelets, red blood cells, neutrophils, and macrophages) 

30 and lymphoid (B and T lymphocytes) cells from pluripotent stem cells. The etiology of 

these immune deficiencies or disorders may be genetic, somatic, such as cancer or some 
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autoimmune disorders, acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, 
METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists of 
METHl and/or METH2, can be used as a marker or detector of a particular immune system 
disease or disorder. 

5 METH 1 and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 

of METHl and/or METH2, may be useful in treating or detecting deficiencies or disorders 
of hematopoietic cells. METHl and/or METH2 polynucleotides or polypeptides, or agonists 
or antagonists of METHl and/or METH2, could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells, in an effort to treat 

10 those disorders associated with a decrease in certain (or many) types hematopoietic cells. 

Examples of immunologic deficiency syndromes include, but are not limited to: blood 
protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia telangiectasia, 
common variable immunodeficiency, Digeorge Syndrome, HIV infection, HTLV-BLV 
infection, leukocyte adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal 

15 dysfunction, severe combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, 

anemia, thromboc5l:openia, or hemoglobinuria. 

Moreover, METHl and/or METH2 polynucleotides or polypeptides, or agonists or 
antagonists of METHl and/or METH2, can also be used to modulate hemostatic (the 
stopping of bleeding) or thrombolytic activity (clot formation). For example, by increasing 

20 hemostatic or thrombolytic activity, METHl and/or METH2 polynucleotides or 

polypeptides, or agonists or antagonists of METHl and/or METH2, could be used to treat 
blood coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, METHl and/or METH2 polynucleotides or polypeptides, or agonists 

25 or antagonists of METHl and/or METH2, that can decrease hemostatic or thrombolytic 

activity could be used to inhibit or dissolve clotting, important in the treatment of heart 
attacks (infarction), strokes, or scarring. 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, may also be useful in treating or detecting autoimmune 

30 disorders. Many autoimmune disorders result from inappropriate recognition of self as 

foreign material by immune cells. This inappropriate recognition results in an immune 
response leading to the destruction of the host tissue. Therefore, the administration of 
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METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists of 
METH 1 and/or METH2, that can inhibit an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected include, but are 
not limited to: Addison's Disease, hemolytic anemia, antiphospholipid syndrome, 
rheumatoid arthritis, dermatitis, allergic encephalomyelitis, glomerulonephritis, 
Goodpasture' s Syndrome, Graves' Disease, Multiple Sclerosis, Myasthenia Gravis, Neuritis, 
Ophthalmia, Bullous Pemphigoid, Pemphigus, Polyendocrinopathies, Purpura, Reiter's 
Disease, Stiff-Man Syndrome, Autoimmune Thyroiditis, Systemic Lupus Erythematosus, 
Autoimmune Pulmonary Inflammation, Guillain-Barre Syndrome, insulin dependent 
diabetes meUitis, and autoinmiune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by METHl and/or METH2 
polynucleotides or polypeptides, or agonists or antagonists of METHl and/or METH2. 
Moreover, these molecules can be used to treat anaphylaxis, hypersensitivity to an antigenic 
molecule, or blood group incompatibility. 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, may also be used to treat and/or prevent organ rejection or graft- 
versus-host disease (GVHD) . Organ rejection occurs by host immune cell destruction of the 
transplanted tissue through an inmiune response. Similarly, an immune response is also 
involved in GVHD, but, in this case, the foreign transplanted immune cells destroy the host 
tissues. The administration of METHl and/or METH2 polynucleotides or polypeptides, or 
agonists or antagonists of METHl and/or METH2, that inhibits an immune response, 
particularly the proliferation, differentiation, or chemotaxis of T-cells, may be an effective 
therapy in preventing organ rejection or GVHD. 

Similarly, METHl and/or METH2 polynucleotides or polypeptides, or agonists or 
antagonists of METHl and/or METH2, may also be used to modulate inflammation. For 
example, METHl and/or METH2 polynucleotides or polypeptides, or agonists or 
antagonists of METHl and/or METH2, may inhibit the proliferation and differentiation of 
cells involved in an inflammatory response. These molecules can be used to treat 
inflammatory conditions, both chronic and acute conditions, including inflammation 
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associated with infection (e.g., septic shock, sepsis, or systemic inflammatory response 
syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, complement- 
mediated hyperacute rejection, nephritis, cytokine or chemokine induced lung injury, 
inflammatory bowel disease, Crohn's disease, or resulting from over production of cytokines 
5 (e.g.,TNForE.-l.) 

Hyperproliferative Disorders 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, can be used to treat or detect hyperproliferative disorders, 
including neoplasms. METHl and/or METH2 polynucleotides or polypeptides, or agonists 

10 or antagonists of METHl and/or METH2, may inhibit the proliferation of the disorder 

through direct or indirect interactions. Alternatively, METHl and/or METH2 
polynucleotides or polypeptides, or agonists or antagonists of METHl and/or METH2, may 
proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing antigenic 

1 5 qualities of the hyperproliferative disorder or by proliferating, differentiating, or mobilizing 

T-cells, hyperproliferative disorders can be treated. This immune response maybe increased 
by either enhancing an existing immune response, or by initiating a new immune response. 
Alternatively, decreasing an immune response may also be a method of treating 
hyperproliferative disorders, such as a chemotherapeutic agent. 

20 Examples of hyperproliferative disorders that can be treated or detected by METH 1 

and/or METH2 polynucleotides or polypeptides, or agonists or antagonists of METHl 
and/or METH2, include, but are not limited to neoplasms located in the: abdomen, bone, 
breast, digestive system, liver, pancreas, peritoneum, endocrine glands (adrenal, parathyroid, 
pituitary, testicles, ovary, thymus, thyroid), eye, head and neck, nervous (central and 

25 peripheral), lymphatic system, pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists of 
METHl and/or METH2. Examples of such hyperproliferative disorders include, but are not 
limited to: hyperganmiaglobulinemia, lymphoproliferative disorders, paraproteinemias, 

30 purpura, sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, Gaucher' s 
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Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia, located 
in an organ system listed above. 

Cardiovascular Disorders 

METH 1 and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
5 of METHl and/or METH2, encoding METHl and/or METH2 may be used to treat 

cardiovascular disorders, including peripheral artery disease, such as limb ischemia. 

Cardiovascular disorders include cardiovascular abnormalities, such as arterio- 
arterial fistula, arteriovenous fistula, cerebral arteriovenous malformations, congenital heart 
defects, pulmonary atresia, and Scimitar Syndrome. Congenital heart defects include aortic 

1 0 coarctation, cor triatriatum, coronary vessel anomalies, crisscross heart, dextrocardia, patent 

ductus arteriosus, Ebstein's anomaly, Eisenmenger complex, hypoplastic left heart syndrome, 
levocardia, tetralogy of fallot, transposition of great vessels, double outlet right ventricle, 
tricuspid atresia, persistent truncus arteriosus, and heart septal defects, such as 
aortopulmonary septal defect, endocardial cushion defects, Lutembacher's Syndrome, trilogy 

15 of Fallot, ventricular heart septal defects. 

Cardiovascular disorders also include heart disease, such as arrhythmias, carcinoid 
heart disease, high cardiac output, low cardiac output, cardiac tamponade, endocarditis 
(including bacterial), heart aneurysm, cardiac arrest, congestive heart failure, congestive 
cardiomyopathy, paroxysmal dyspnea, cardiac edema, heart hypertrophy, congestive 

20 cardiomyopathy, left ventricular hypertrophy, right ventricular hypertrophy, post-infarction 

heart rupture, ventricular septal rupture, heart valve diseases, myocardial diseases, 
myocardial ischemia, pericardial effusion, pericarditis (including constrictive and 
tuberculous), pneumopericardium, postpericardiotomy syndrome, pulmonary heart disease, 
rheumatic heart disease, ventricular dysfunction, hyperemia, cardiovascular pregnancy 

25 complications. Scimitar Syndrome, cardiovascular syphilis, and cardiovascular tuberculosis. 

Arrhythmias include sinus arrhythmia, atrial fibrillation, atrial flutter, bradycardia, 
extrasystole, Adams-Stokes Syndrome, bundle-branch block, sinoatrial block, long QT 
syndrome, parasystole, Lown-Ganong-Levine Syndrome, Mahaim-type pre-excitation 
syndrome, Wolff-Parkinson-White syndrome, sick sinus syndrome, tachycardias, and 

30 ventricular fibrillation. Tachycardias include paroxysmal tachycardia, supraventricular 

tachycardia, accelerated idioventricular rhythm, atrioventricular nodal reentry tachycardia, 
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ectopic atrial tachycardia, ectopic junctional tachycardia, sinoatrial nodal reentry 
tachycardia, sinus tachycardia. Torsades de Pointes, and ventricular tachycardia. 

Heart valve diseases include aortic valve insufficiency, aortic valve stenosis, hear 
murmurs, aortic valve prolapse, mitral valve prolapse, tricuspid valve prolapse, mitral valve 
insufficiency, mitral valve stenosis, pulmonary atresia, pulmonary valve insufficiency, 
pulmonary valve stenosis, tricuspid atresia, tricuspid valve insufficiency, and tricuspid valve 
stenosis. 

Myocardial diseases include alcoholic cardiomyopathy, congestive cardiomyopathy, 
hypertrophic cardiomyopathy, aortic subvalvular stenosis, pulmonary subvalvular stenosis, 
restrictive cardiomyopathy, Chagas cardiomyopathy, endocardial fibroelastosis, 
endomyocardial fibrosis, Kearns Syndrome, myocardial reperfusion injury, and myocarditis. 

Myocardial ischemias include coronary disease, such as angina pectoris, coronary 
aneurysm, coronary arteriosclerosis, coronary thrombosis, coronary vasospasm, myocardial 
infarction and myocardial stunning. 

Cardiovascular diseases also include vascular diseases such as aneurysms, 
angiodysplasia, angiomatosis, bacillary angiomatosis, Hippel-Lindau Disease, Klippel- 
Trenaunay-Weber Syndrome, Sturge-Weber Syndrome, angioneurotic edema, aortic 
diseases, Takayasu's Arteritis, aortitis, Leriche's Syndrome, arterial occlusive diseases, 
arteritis, enarteritis, polyarteritis nodosa, cerebrovascular disorders, diabetic angiopathies, 
diabetic retinopathy, emboUsms, thrombosis, erythromelalgia, hemorrhoids, hepatic veno- 
occlusive disease, hypertension, hypotension, ischemia, peripheral vascular diseases, 
phlebitis, pulmonary veno-occlusive disease, Raynaud's disease, CREST syndrome, retinal 
vein occlusion. Scimitar syndrome, superior vena cava syndrome, telangiectasia, atacia 
telangiectasia, hereditary hemorrhagic telangiectasia, varicocele, varicose veins, varicose 
ulcer, vasculitis, and venous insufficiency. 

Aneurysms include dissecting aneurysms, false aneurysms, infected aneurysms, 
ruptured aneurysms, aortic aneurysms, cerebral aneurysms, coronary aneurysms, heart 
aneurysms, and iliac aneurysms. 

Arterial occlusive diseases include arteriosclerosis, intermittent claudication, carotid 
stenosis, fibromuscular dysplasias, mesenteric vascular occlusion, Moyamoya disease, renal 
artery obstruction, retinal artery occlusion, and thromboangiitis obliterans. 
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Cerebrovascular disorders include carotid artery diseases, cerebral amyloid 
angiopathy, cerebral aneurysm, cerebral anoxia, cerebral arteriosclerosis, cerebral 
arteriovenous malformation, cerebral artery diseases, cerebral embolism and thrombosis, 
carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, cerebral hemorrhage, 
5 epidural hematoma, subdural hematoma, subaraxhnoid hemorrhage, cerebral infarction, 

cerebral ischemia (including transient), subclavian steal syndrome, periventricular 
leukomalacia, vascular headache, cluster headache, migraine, and vertebrobasilar 
insufficiency. 

Embolisms include air embolisms, amniotic fluid embolisms, cholesterol embolisms, 

10 blue toe syndrome, fat embolisms, pulmonary embolisms, and thromoboembolisms. 

Thrombosis include coronary thrombosis, hepatic vein thrombosis, retinal vein occlusion, 
carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, and thrombophlebitis. 

Ischemia includes cerebral ischemia, ischenaic colitis, compartment syndromes, 
anterior compartment syndrome, myocardial ischemia, reperfusion injuries, and peripheral 

15 limb ischemia. Vasculitis includes aortitis, arteritis, Behcet's Syndrome, Churg-Strauss 

Syndrome, mucocutaneous lymph node syndrome, thromboangiitis obliterans, 
hypersensitivity vasculitis, Schoenlein-Henoch purpura, allergic cutaneous vasculitis, and 
Wegener's granulomatosis. 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 

20 of METHl and/or METH2, are especially effective for the treatment of critical limb 

ischemia and coronary disease. 

METH 1 and/or METH2 polypeptides may be administered using any method known 
in the art, including, but not limited to, direct needle injection at the delivery site, 
intravenous injection, topical administration, catheter infusion, biolistic injectors, particle 

25 accelerators, gelfoam sponge depots, other commercially available depot materials, osmotic 

pumps, oral or suppositorial solid pharmaceutical formulations, decanting or topical 
applications during surgery, aerosol delivery. Such methods are known in the art. METHl 
and/or METH2 polypeptides may be administered as part of a pharmaceutical composition, 
described in more detail below. Methods of delivering METHl and/or METH2 

30 polynucleotides are described in more detail herein. 
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Diseases at the Cellular Level 

Diseases associated with increased cell survival or the inhibition of apoptosis that 
could be treated or detected by METHl and/or METH2 polynucleotides or polypeptides, as 
well as antagonists or agonists of METHl and/or METH2, include cancers (such as 
follicular lymphomas, carcinomas with p53 mutations, and hormone-dependent tumors, 
including, but not hmited to colon cancer, cardiac tumors, pancreatic cancer, melanoma, 
retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular cancer, stomach 
cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, osteoblastoma, 
osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, breast cancer, prostate cancer, 
Kaposi's sarcoma and ovarian cancer); autoimmune disorders (such as, multiple sclerosis, 
Sjogren's syndrome, Hashimoto's thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's 
disease, polymyositis, systemic lupus erythematosus and immune-related glomerulonephritis 
and rheumatoid arthritis) and viral infections (such as herpes viruses, pox viruses and 
adenoviruses), inflammation, graft v, host disease, acute graft rejection, and chronic graft 
rejection. In preferred embodiments, METHl and/or METH2 polynucleotides, polypeptides, 
and/or antagonists of the invention are used to inhibit growth, progression, and/or metasis 
of cancers, in particular those listed above. 

Additional diseases or conditions associated with increased cell survival that could 
be treated or detected by METHl and/or METH2 polynucleotides or polypeptides, or 
agonists or antagonists of METHl and/or METH2, include, but are not limited to, 
progression, and/or metastases of malignancies and related disorders such as leukemia 
(including acute leukemias (e.g., acute lymphocytic leukemia, acute myelocytic leukemia 
(including myeloblastic, promyelocytic, myelomonocytic, monocytic, and erythroleukemia)) 
and chronic leukemias (e.g., chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia)), polycythemia vera, lymphomas (e.g., Hodgkin's disease and 
non-Hodgkin's disease), multiple myeloma, Waldenstrom's macroglobulinemia, heavy chain 
disease, and solid tumors including, but not limited to, sarcomas and carcinomas such as 
fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
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carcinoma, papillary carcinoma, papillary adenocarcinomas, cyst adenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor, cervical cancer, testicular 
tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
5 glioma, astrocytoma, meduUoblastoma, craniopharyngioma, ependymoma, pinealoma, 

hemangioblastoma, acoustic neuroma, oligodendroglioma, menangioma, melanoma, 
neuroblastoma, and retinoblastoma. 

Diseases associated with increased apoptosis that could be treated or detected by 
METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists 

10 of METHl and/or METH2, include AIDS; neurodegenerative disorders (such as 

Alzheimer's disease, Parkinson's disease. Amyotrophic lateral sclerosis, Retinitis 
pigmentosa. Cerebellar degeneration and brain tumor or prior associated disease); 
autoimmune disorders (such as, multiple sclerosis, Sjogren's syndrome, Hashimoto's 
thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's disease, polymyositis, systemic lupus 

15 erythematosus and immune-related glomerulonephritis and rheumatoid arthritis) 

myelodysplastic syndromes (such as aplastic anemia), graft v. host disease, ischemic injury 
(such as that caused by myocardial infarction, stroke and reperfusion injury), liver injury 
(e.g., hepatitis related liver injury, ischemia/reperfusion injury, cholestosis (bile duct injury) 
and liver cancer); toxin-induced liver disease (such as that caused by alcohol), septic shock, 

20 cachexia and anorexia. 



Wound Healing and Epithelial Cell Proliferation 

hi accordance with yet a further aspect of the present invention, there is provided a 
process for utilizing METHl and/or METH2 polynucleotides or polypeptides, as well as 
agonists or antagonists of METHl and/or METH2, for therapeutic purposes, for example, 

25 to stimulate epithelial cell proliferation and basal keratinocytes for the purpose of wound 

healing, and to stimulate hair follicle production and healing of dermal wounds. METHl 
and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists of 
METHl and/or METH2, may be clinically useful in stimulating wound healing including 
surgical wounds, excisional wounds, deep wounds involving damage of the dermis and 

30 epidermis, eye tissue wounds, dental tissue wounds, oral cavity wounds, diabetic ulcers, 

dermal ulcers, cubitus ulcers, arterial ulcers, venous stasis ulcers, burns resulting from heat 
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exposure or chemicals, and other abnormal wound healing conditions such as uremia, 
malnutrition, vitamin deficiencies and complications associted with systemic treatment with 
steroids, radiation therapy and antineoplastic drugs and antimetabolites. METHl and/or 
IVIETH2 polynucleotides or polypeptides, as well as agonists or antagonists of METHl 
5 and/or METH2, could be used to promote dermal reestablishment subsequent to dermal loss 

METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or 
antagonists of METHl and/or METH2, could be used to increase the adherence of skin 
grafts to a wound bed and to stimulate re-epithelialization from the wound bed. The 
following are types of grafts that METHl and/or METH2 polynucleotides or polypeptides, 

10 agonists or antagonists of METHl and/or METH2, could be used to increase adherence to 

a wound bed: autografts, artificial skin, allografts, autodermic graft, autoepdermic grafts, 
avacular grafts, Blair-Brown grafts, bone graft, brephoplastic grafts, cutis graft, delayed 
graft, dermic graft, epidermic graft, fascia graft, full thickness graft, heterologous graft, 
xenograft, homologous graft, hyperplastic graft, lamellar graft, mesh graft, mucosal graft, 

15 Ollier-Thiersch graft, omenpal graft, patch graft, pedicle graft, penetrating graft, split skin 

graft, thick split graft. METHl and/or METH2 polynucleotides or polypeptides, as well as 
agonists or antagonists of METHl and/or METH2, can be used to promote skin strength and 
to improve the appearance of aged skin. 

It is believed that METHl and/or METH2 polynucleotides or polypeptides, as well 

20 as agonists or antagonists of METHl and/or METH2, will also produce changes in 

hepatocyte proliferation, and epithelial cell proliferation in the lung, breast, pancreas, 
stomach, small intesting, and large intestine. METHl and/or METH2 polynucleotides or 
polypeptides, as well as agonists or antagonists of METHl and/or METH2, could promote 
proliferation of epithelial cells such as sebocytes, hair follicles, hepatocytes, type n 

25 pneumocytes, mucin-producing goblet cells, and other epithelial cells and their progenitors 

contained within the skin, lung, liver, and gastrointestinal tract. METHl and/or METH2 
polynucleotides or polypeptides, agonists or antagonists of METHl and/or METH2, may 
promote proliferation of endothelial cells, keratinocytes, and basal keratinocytes. 

METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or 

30 antagonists of METHl and/or METH2, could also be used to reduce the side effects of gut 

toxicity that result from radiation, chemotherapy treatments or viral infections. METHl 
and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists of 



-252- 



METHl and/or METH2, may have a cytoprotective effect on the small intestine mucosa. 
METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists 
of METHl and/or METH2, may also stimulate healing of mucositis (mouth ulcers) that 
result from chemotherapy and viral infections. 
5 METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or 

antagonists of METHl and/or METH2, could further be used in full regeneration of skin in 
full and partial thickness skin defects, including burns, (i.e., repopulation of hair follicles, 
sweat glands, and sebaceous glands), treatment of other skin defects such as psoriasis. 
METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists 

10 of METHl and/or METH2, could be used to treat epidermolysis bullosa, a defect in 

adherence of the epidermis to the underlying dermis which results in frequent, open and 
painful blisters by accelerating reepithelialization of these lesions. METHl and/or METH2 
polynucleotides or polypeptides, as well as agonists or antagonists of METHl and/or 
METH2, could also be used to treat gastric and doudenal ulcers and help heal by scar 

1 5 formation of the mucosal lining and regeneration of glandular mucosa and duodenal mucosal 

lining more rapidly. Liflamamatory bowel diseases, such as Crohn's disease and ulcerative 
colitis, are diseases which result in destruction of the mucosal surface of the small or large 
intestine, respectively. Thus, METHl and/or METH2 polynucleotides or polypeptides, as 
well as agonists or antagonists of METHl and/or METH2, could be used to promote the 

20 resurfacing of the mucosal surface to aid more rapid healing and to prevent progression of 

inflammatory bowel disease. Treatment with METHl and/or METH2 polynucleotides or 
polypeptides, agonists or antagonists of METHl and/or METH2, is expected to have a 
significant effect on the production of mucus throughout the gastrointestinal tract and could 
be used to protect the intestinal mucosa from injurious substances that are ingested or 

25 following surgery. METHl and/or METH2 polynucleotides or polypeptides, as well as 

agonists or antagonists of METHl and/or METH2, could be used to treat diseases associate 
with the under expression of METHl and/or METH2. 

Moreover, METHl and/or METH2 polynucleotides or polypeptides, as well as agonists 
or antagonists of METHl and/or METH2, could be used to prevent and heal damage to the 

30 lungs due to various pathological states. A growth factor such as METHl and/or METH2 

polynucleotides or polypeptides, as well as agonists or antagonists of METHl and/or 
METH2, could stimulate proliferation and differentiation and promote the repair of alveoli 
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and brochiolar epithelium to prevent or treat acute or chronic lung damage. For example, 
emphysema, which results in the progressive loss of aveoli, and inhalation injuries, i.e,, 
resulting from smoke inhalation and burns, that cause necrosis of the bronchiolar epithelium 
and alveoli could be effectively treated using METHl and/or METH2 polynucleotides or 
5 polypeptides, agonists or antagonists of METHl and/or METH2. Also, METHl and/or 

METH2 polynucleotides or polypeptides, as well as agonists or antagonists of METHl 
and/or METH2, could be used to stimulate the proliferation and differentiation of type II 
pneumocytes, which may help treat or prevent diseases such as hyaline membrane diseases, 
such as infant respiratory distress syndrome and bronchopulmonary displasia, in premature 
10 infants. 

METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or 
antagonists of METHl and/or METH2, could stimulate the proliferation and differentiation 
of hepatocytes and, thus, could be used to alleviate or treat liver diseases and pathologies 
such as fulminant liver failure caused by cirrhosis, liver damage caused by viral hepatitis and 
15 toxic substances (i.e., acetaminophen, carbon tetraholoride and other hepatotoxins known 

in the art). 

In addition, METHl and/or METH2 polynucleotides or polypeptides, as well as 
agonists or antagonists of METH 1 and/or METH2, could be used treat or prevent the onset 
of diabetes mellitus. In patients with newly diagnosed Types I and II diabetes, where some 

20 islet cell function remains , METH 1 and/or METH2 polynucleotides or polypeptides , as well 

as agonists or antagonists of METHl and/or METH2, could be used to maintain the islet 
function so as to alleviate, delay or prevent permanent manifestation of the disease. Also, 
METHl and/or METH2 polynucleotides or polypeptides, as well as agonists or antagonists 
of METHl and/or METH2, could be used as an auxiliary in islet cell transplantation to 

25 improve or promote islet cell function. 

Infectious Disease 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, can be used to treat or detect infectious agents. For example, 
by increasing the immune response, particularly increasing the proliferation and 
30 differentiation of B and/or T cells, infectious diseases may be treated. The immune response 

may be increased by either enhancing an existing immune response, or by initiating a new 
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immune response. Alternatively, METHl and/or METH2 polynucleotides or polypeptides, 
or agonists or antagonists of METHl and/or METH2, may also directly inhibit the infectious 
agent, without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or symptoms 
5 that can be treated or detected by METH 1 and/or METH2 polynucleotides or polypeptides, 

or agonists or antagonists of METHl and/or METH2. Examples of viruses, include, but are 
not limited to the following DNA and RNA viral families: Arbovirus, Adenoviridae, 
Arenaviridae, Arterivirus, Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, 
Coronaviridae, Flaviviridae, Hepadnaviridae (Hepatitis), Herpesviridae (such as, 

10 Cytomegalovirus, Herpes Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, 

Morbilli virus, Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), Papovaviridae, 
Parvoviridae, Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., Rubivirus). 
Viruses falling within these families can cause a variety of diseases or symptoms, including, 

15 but not limited to: arthritis, bronchiollitis, encephalitis, eye infections (e.g., conjunctivitis, 

keratitis), chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), 
meningitis, opportunistic infections (e.g., AIDS), pneumonia, Burkitt's Lymphoma, 
chickenpox , hemorrhagic fever. Measles, Mumps, Parainfluenza, Rabies, the common cold, 
Polio, leukemia, Rubella, sexually transmitted diseases, skin diseases (e.g., Kaposi *s, warts), 

20 and viremia. METHl and/or METH2 polynucleotides or polypeptides, or agonists or 

antagonists of METHl and/or METH2, can be used to treat or detect any of these symptoms 
or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that can 
be treated or detected by METHl and/or METH2 polynucleotides or polypeptides, or 

25 agonists or antagonists of METH 1 and/or METH2, include, but not limited to, the following 

Gram-Negative and Gram-positive bacterial families and fungi: Actinomycetales (e.g., 
Corynebacterium, Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, 
Clostridium), Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae 

30 (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 

Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, Gonorrhea, 
Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, Heamophilus, Pasteurella), 
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Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, and Staphylococcal. These 
bacterial or fungal families can cause the following diseases or symptoms, including, but not 
limited to: bacteremia, endocarditis, eye infections (conjunctivitis, tuberculosis, uveitis), 
gingivitis, opportunistic infections (e.g,, AE3S related infections), paronychia, prosthesis- 
5 related infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough 

or Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia. Gonorrhea, meningitis. Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases (e.g., 

10 cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections, METHl 

and/or METH2 polynucleotides or polypeptides, or agonists or antagonists of METHl 
and/or METH2, can be used to treat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by METHl and/or METH2 polynucleotides or polypeptides, or agonists or 

15 antagonists of METHl and/or METH2, include, but are not limited to, the following 

families: Amebiasis, Babesiosis, Coccidiosis, Cryptosporidiosis, Dientamoebiasis, Dourine, 
Ectoparasitic, Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, 
Trypanosomiasis, and Trichomonas. These parasites can cause a variety of diseases or 
symptoms, including, but not limited to: Scabies, Trombiculiasis, eye infections, intestinal 

20 disease (e.g., dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., 

AIDS related). Malaria, pregnancy complications, and toxoplasmosis. METHl and/or 
METH2 polynucleotides or polypeptides, or agonists or antagonists of METHl and/or 
METH2, can be used to treat or detect any of these symptoms or diseases. 

Preferably, treatment using METHl and/or METH2 polynucleotides or polypeptides, 

25 or agonists or antagonists of METHl and/or METH2, could either be by administering an 

effective amount of METHl and/or METH2 polypeptide to the patient, or by removing cells 
from the patient, supplying the cells with METHl and/or METH2 polynucleotide, and 
returning the engineered cells to the patient (ex vivo therapy). Moreover, the METHl 
and/or METH2 polypeptide or polynucleotide can be used as an antigen in a vaccine to raise 

30 an immune response against infectious disease. 
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Regeneration 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, can be used to differentiate, proliferate, and attract cells, leading 
to the regeneration of tissues. (See, Science 276:59-87 (1997).) The regeneration of tissues 
5 could be used to repair, replace, or protect tissue damaged by congenital defects, trauma 

(wounds, burns, incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, 
periodontal disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs (e.g., 

10 pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac), 

vasculature (including vascular and lymphatics), nervous, hematopoietic, and skeletal (bone, 
cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs without or decreased 
scarring. Regeneration also may include angiogenesis. 

Moreover, METHl and/or METH2 polynucleotides or polypeptides, or agonists or 

1 5 antagonists of METH 1 and/or METH2, may increase regeneration of tissues difficult to heal. 

For example, increased tendon/ligament regeneration would quicken recovery time after 
damage. METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
of METHl and/or METH2, of the present invention could also be used prophylactically in 
an effort to avoid damage. Specific diseases that could be treated include of tendinitis, 

20 carpal tunnel syndrome, and other tendon or ligament defects. A further example of tissue 

regeneration of non-healing wounds includes pressure ulcers, ulcers associated with vascular 
insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using METHl and/or 
METH2 polynucleotides or polypeptides, or agonists or antagonists of METHl and/or 

25 METH2, to proliferate and differentiate nerve cells. Diseases that could be treated using this 

method include central and peripheral nervous system diseases, neuropathies, or mechanical 
and traumatic disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, 
and stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 

30 neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, Parkinson's 

disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome), 
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could all be treated using the METHl and/or METH2 polynucleotides or polypeptides, or 
agonists or antagonists of METHl and/or METH2. 

Chemotaxis 

METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 
5 of METHl and/or METH2, may have chemotaxis activity. A chemotaxic molecule attracts 

or mobilizes cells (e.g., monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, 
epithelial and/or endothelial cells) to a particular site in the body, such as a site of 
inflammation, infection, or hyperproliferation. The mobilized cells can then fight off and/or 
heal the particular trauma or abnormality. 

10 METHl and/or METH2 polynucleotides or polypeptides, or agonists or antagonists 

of METHl and/or METH2, may increase chemotaxic activity of particular cells. These 
chemotactic molecules can then be used to treat inflammation, infection, hyperproliferative 
disorders, or any immune system disorder by increasing the number of cells targeted to a 
particular location in the body. For example, chemotaxic molecules can be used to treat 

15 wounds and other trauma to tissues by attracting immune cells to the injured location. As 

a chemotactic molecule, METHl and/or METH2 could also attract fibroblasts, which can 
be used to treat wounds. 

It is also contemplated that METH 1 and/or METH2 polynucleotides or polypeptides, 
or agonists or antagonists of METHl and/or METH2, may inhibit chemotactic activity. 

20 These molecules could also be used to treat disorders. Thus, METHl and/or METH2 

polynucleotides or polypeptides, or agonists or antagonists of METHl and/or METH2, could 
be used as an inhibitor of chemotaxis. 

Binding Activity 

METH 1 and/or METH2 polypeptides may be used to screen for molecules that bind 
25 to METHl and/or METH2 or for molecules to which METHl and/or METH2 binds. The 

binding of METHl and/or METH2 and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the METHl and/or METH2 or the molecule bound. 
Examples of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors), 
or small molecules. 
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Preferably, the molecule is closely related to the natural ligand of METHl and/or 
METH2, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural or 
functional mimetic. (See, Coligan et al. Current Protocols in Inmiunology 1(2): Chapter 5 
(1991).) Similarly, the molecule can be closely related to the natural receptor to which 
5 METH 1 and/or METH2 binds, or at least, a fragment of the receptor capable of being bound 

by METHl and/or METH2 (e.g., active site). Li either case, the molecule can be rationally 
designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells 
which express METH 1 and/or METH2, either as a secreted protein or on the cell membrane. 
10 Preferred cells include cells from manmials, yeast, Drosophila, or E. coli. Cells expressing 

METHl and/or METH2(or cell membrane containing the expressed polypeptide) are then 
preferably contacted with a test compound potentially containing the molecule to observe 
binding, stimulation, or inhibition of activity of either METHl and/or METH2 or the 
molecule. 

15 The assay may simply test binding of a candidate compound toMETHl and/or 

METH2, wherein binding is detected by a label, or in an assay involving competition with 
a labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to METHl and/or METH2. 

Alternatively, the assay can be carried out using cell-free preparations, 

20 polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 

mixtures. The assay may also simply comprise the steps of roixing a candidate compound 
with a solution containing METHl and/or METH2, measuring METHl and/or 
METH2/molecule activity or binding, and comparing the METHl and/or METH2/molecule 
activity or binding to a standard, 

25 Preferably, an ELISA assay can measure METHl and/or METH2 level or activity 

in a sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure METHl and/or METH2 level or activity by either binding, directly 
or indirectly, to METHl and/or METH2 or by competing with METHl and/or METH2 for 
a substrate. 

30 Additionally, the receptor to which METHl and/or METH2 binds can be identified 

by numerous methods known to those of skill in the art, for example, ligand panning and 
FACS sorting (Coligan, et al. Current Protocols in hnmun., 1(2), Chapter 5, (1991)). For 
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example, expression cloning is employed wherein polyadenylated RNA is prepared from a 
cell responsive to the polypeptides, for example, NIH3T3 cells which are known to contain 
multiple receptors for the FGF family proteins, and SC-3 cells, and a cDNA library created 
from this RNA is divided into pools and used to transfect COS cells or other cells that are 
5 not responsive to the polypeptides. Transfected cells which are grown on glass slides are 

exposed to the polypeptide of the present invention, after they have been labelled. The 
polypeptides can be labeled by a variety of means including iodination or inclusion of a 
recognition site for a site-specific protein kinase. 

Following fixation and incubation, the slides are subjected to auto-radiographic 

10 analysis. Positive pools are identified and sub-pools are prepared and re-transfected using 

an iterative sub-pooling and re-screening process, eventually yielding a single clones that 
encodes the putative receptor. 

As an alternative approach for receptor identification, the labeled polypeptides can 
be photoaffinity linked with cell membrane or extract preparations that express the receptor 

15 molecule. Cross-linked material is resolved by PAGE analysis and exposed to X-ray film. 

The labeled complex containing the receptors of the polypeptides can be excised, resolved 
into peptide fragments, and subjected to protein microsequencing. The amino acid sequence 
obtained from microsequencing would be used to design a set of degenerate oligonucleotide 
probes to screen a cDNA library to identify the genes encoding the putative receptors. 

20 Moreover, the techniques of gene-shuffling, motif-shuffling, exon-shuffling, and/or 

codon-shuffling (collectively referred to as "DNA shuffling") may be employed to modulate 
the activities of METHl or METH2 thereby effectively generating agonists and antagonists 
of METHl or METH2. See generally, U.S. Patent Nos. 5,605,793, 5,811,238, 5,830,721, 
5,834,252, and 5,837,458; and Patten, P. A. et al, Curr, Opinion Biotechnol. 8:724-733 

25 (1997); Harayama, S. Trends Biotechnol. 16(2):76-S2 (1998); Hansson, L.O. etaL, /. Mol 

Biol. 287:265-276 (1999); and Lorenzo, M.M. andBlasco, R. Biotechniques 24(2 ):30S-313 
( 1 998) (each of these patents and publications are hereby incorporated by reference), la one 
embodiment, alteration of METHl or METH2 polynucleotides and corresponding 
polypeptides may be achieved by DNA shuffling, DNA shuffling involves the assembly of 

30 two or more DNA segments into a desired METHl or METH2 molecule by homologous, 

or site-specific, recombination. 
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In another embodiment, METHl or METH2 polynucleotides and corresponding 
polypeptides may be altered by being subj ected to random mutagenesis by error-prone PGR, 
random nucleotide insertion, or other methods prior to recombination. In another 
embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., or 
5 METHl or METH2 may be recombined with one or more components, motifs, sections, 

parts, domains, fragments, etc. of one or more heterologous molecules. In preferred 
embodiments, the heterologous molecule is a growth factor such as, for example, platelet- 
derived growth factor (PDGF), insulin-like growth factor (IGI-I), transforming growth factor 
(TGF)-alpha, epidermal growth factor (EOF), fibroblast growth factor (FGF), TGF-beta, 

10 bone morphogenetic protein (BMP)-2, BMP-4, BMP-5, BMP-6, BMP-6, BMP-7, activins 

A andB, decapentaplegic (dpp), 60A, OP-2, dorsalin, growth differentiation factors (GDFs), 
nodal, MIS, inhibin-alpha, TGF-betal, TGF-beta2, TGF-beta3, TGF-beta5, and glial-derived 
neutrophic factor (GDNF), 

Other preferred fragments are biologically active METHl or METH2 fragments. 

15 Biologically active fragments are those exhibiting activity similar, but not necessarily 

identical, to an activity of the METHl or METH2 polypeptide. The biological activity of 
the fragments may include an improved desired activity, or a decreased undesirable activity. 

Additionally, this invention provides a method of screening compounds to identify 
those which modulate the action of the polypeptide of the present invention. An example 

20 of such an assay comprises combining a mammalian fibroblast cell, the polypeptide of the 

present invention, the compound to be screened and 3[H] thymidine under cell culture 
conditions where the fibroblast cell would normally proliferate. A control assay may be 
performed in the absence of the compound to be screened and compared to the amount of 
fibroblast proliferation in the presence of the compound to determine if the compound 

25 stimulates proliferation by determining the uptake of 3[H] thymidine in each case. The 

amount of fibroblast cell proliferation is measured by liquid scintillation chromatography 
which measures the incorporation of 3[H] thymidine. Both agonist and antagonist 
compounds may be identified by this procedure. 

In another method, a mammalian cell or membrane preparation expressing a receptor 

30 for a polypeptide of the present invention is incubated with a labeled polypeptide of the 

present invention in the presence of the compound. The ability of the compound to enhance 
or block this interaction could then be measured. Alternatively, the response of a known 
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second messenger system following interaction of a compound to be screened and the 
METHl and/or METH2 receptor is measured and the ability of the compound to bind to the 
receptor and elicit a second messenger response is measured to determine if the compound 
is a potential agonist or antagonist. Such second messenger systems include but are not 
5 limited to, cAMP guanylate cyclase, ion channels or phosphoinositide hydrolysis. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
METHl and/or METH2/molecule. Moreover, the assays can discover agents which may 

1 0 inhibit or enhance the production of METH 1 and/or METH2 from suitably manipulated cells 

or tissues. Therefore, the invention includes a method of identifying compounds which 
bind to METHl and/or METH2 comprising the steps of: (a) incubating a candidate binding 
compound with METHl and/or METH2; and (b) determining if binding has occurred. 
Moreover, the invention includes a method of identifying agonists/antagonists comprising 

15 the steps of: (a) incubating a candidate compound with METHl and/or METH2, (b) 

assaying a biological activity , and (b) determining if a biological activity of METHl and/or 
METH2 has been altered. 

Also, one could identify molecules which bind METHl and/or METH2 
experimentally by using the beta-pleated sheet regions disclosed in Figures 10 and 1 1 and 

20 Tables 1 and 2. Accordingly, specific embodiments of the invention are directed to 

polynucleotides encoding polypeptides which comprise, or alternatively consist of, the 
amino acid sequence of each beta pleated sheet regions disclosed in Figure 10/Table 1 and 
Figure 1 1 /Table 2. Additional embodiments of the invention are directed to polynucleotides 
encoding METHl and/or METH2 polypeptides which comprise, or alternatively consist of, 

25 any combination or all of the beta pleated sheet regions disclosed in Figure 10/Table 1 and 

Figure 1 1/Table 2. Additional preferred embodiments of the invention are directed to 
polypeptides which comprise, or alternatively consist of, the METHl and/or METH2 amino 
acid sequence of each of the beta pleated sheet regions disclosed in Figure 10/Table 1 and 
Figure 1 1/Table 2. Additional embodiments of the invention are directed to METHl and/or 

30 METH2 polypeptides which comprise, or alternatively consist of, any combination or all of 

the beta pleated sheet regions disclosed in Figure 10/Table 1 and Figure 1 1/Table 2. 
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Antisense And Ribozyme (Antagonists) 

In specific embodiments, antagonists according to the present invention are nucleic 
acids corresponding to the sequences contained in SEQ ID NO: 1 or 3 , or the complementary 
strand thereof, and/or to nucleotide sequences contained in the deposited clones. In one 
embodiment, antisense sequence is generated internally by the organism, in another 
embodiment, the antisense sequence is separately administered (see, for example, O'Connor, 
J,, Neurochem. 56:560 (1991). Ohgodeoxynucleotides as Anitsense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, PL (1988). Antisense technology can be used to 
control gene expression through antisense DNA or RNA, or through triple-helix formation. 
Antisense techniques are discussed for example, in Okano, J., Neurochem. 56:560 (1991); 
Ohgodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, 
FL (1988). Triple helix formation is discussed in, for instance, Lee et al, Nucleic Acids 
Research 6:3073 (1979); Cooney etal. Science 241:456 (1988); and Dervan etal. Science 
257:1300 (1991). The methods are based on binding of a polynucleotide to a 
complementary DNA or RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the mature 
polypeptide of the present invention may be used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is 
designed to be complementary to a region of the gene involved in transcription thereby 
preventing transcription and the production of the receptor. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into receptor polypeptide. 

In one embodiment, the METHl and/or METH2 antisense nucleic acid of the 
invention is produced intracellularly by transcription from an exogenous sequence. For 
example, a vector or a portion thereof, is transcribed, producing an antisense nucleic acid 
(RNA) of the invention. Such a vector would contain a sequence encoding the METHl 
and/or METH2 antisense nucleic acid. Such a vector can remain episomal or become 
chromosomally integrated, as long as it can be transcribed to produce the desired antisense 
RNA. Such vectors can be constructed by recombinant DNA technology methods standard 
in the art. Vectors can be plasmid, viral, or others know in the art, used for replication and 
expression in vertebrate cells. Expression of the sequence encoding METHl and/or 
METH2, or fragments thereof, can be by any promoter known in the art to act in vertebrate, 
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preferably human cells. Such promoters can be inducible or constitutive. Such promoters 
include, but are not limited to, the SV40 early promoter region (Bernoist and Chambon, 
Nature 29:304-310 (1981), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto et al. Cell 22:787-797 (1980), the herpes thymidine promoter 
(Wagner e^aZ.,Proc.Natl. Acad. Sci, U.S.A. 78:1441-1445 (1981), the regulatory sequences 
of the metallothionein gene (Brinster, et aL, Nature 296:39-42 (1982)), etc. 

The antisense nucleic acids of the invention comprise a sequence complementary to 
at least a portion of an RNA transcript of a METHl and/or METH2 gene. However, 
absolute complementarity, although preferred, is not required. A sequence "complementary 
to at least a portion of an RNA" referred to herein, means a sequence having sufficient 
complementarity to be able to hybridize with the RNA, formdng a stable duplex; in the case 
of double stranded METHl and/or METH2 antisense nucleic acids, a single strand of the 
duplex DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the antisense 
nucleic acid Generally, the larger the hybridizing nucleic acid, the more base mismatches 
with a METHl and/or METH2 RNA it may contain and still form a stable duplex (or triplex 
as the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by 
use of standard procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3' 
untranslated sequences of mRNAs have been shown to be effective at inhibiting translation 
of mRNAs as well. See generally, Wagner, R., 1994, Nature J72:333-335. Thus, 
oUgonucleotides complementary to either the 5'- or 3'- non- translated, non-coding regions 
of METHl and/or METH2 shown in Figure 1 could be used in an antisense approach to 
inhibit translation of endogenous METHl and/or METH2 mRNA. Oligonucleotides 
complementary to the 5' untranslated region of the mRNA should include the complement 
of the AUG start codon. Antisense oligonucleotides complementary to mRNA coding 
regions are less efficient inhibitors of translation but could be used in accordance with the 
invention. Whether designed to hybridize to the 5'-, 3'- or coding region of METHl and/or 
METH2 mRNA, antisense nucleic acids should be at least six nucleotides in length, and are 
preferably oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific 
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aspects the oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 
nucleotides or at least 50 nucleotides. 

The polynucleotides of the invention can be DN A or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double-stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone, 
for example, to improve stability of the molecule, hybridization, etc. The oligonucleotide 
may include other appended groups such as peptides (e.g., for targeting host cell receptors 
in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al , 
1989, Proc, Natl. Acad. Sci, U.S.A. 86:6553-6556; Lemaitre etal, 1987, Proc. Natl. Acad. 
Sci. 84:648-652; PCT Publication No. WO88/09810, published December 15, 1988) or the 
blood-brain barrier (see, e.g., PCT Publication No. WO89/101 34, pubhshed April 25, 1988), 
hybridization-triggered cleavage agents. (See, e.g., Krol et al ,1988, BioTechniques 6:958- 
976) or intercalating agents. (See, e.g., Zon, 1988, Pharm, Res, 5:539-549). To this end, 
the oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety which 
is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose, 
and hexose. 
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In yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group including, but not limited to, a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual b-units, the strands run parallel 
to each other (Gautier et al, 1987, Nucl Acids Res, 75:6625-6641). The oligonucleotide is 
a 2'-0-methylribonucleotide (Inoue et a/., 1987, Nucl Acids Res, 75:6131-6148), or a 
chimeric RNA-DNA analogue (Inoue et al, 1987, FEES Lett, 275:327-330). 

Polynucleotides of the invention may be synthesized by standard methods known in 
the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
fromBiosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al (1988, Nucl. Acids Res. 16:3209), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore glass 
polymer supports (Sarin et al, 1988, Proc, Natl Acad ScL USA, 85:7448-7451), etc. 

While antisense nucleotides complementary to the METHl and/or METH2 coding 
region sequence could be used, those complementary to the transcribed untranslated region 
are most preferred. 

Potential antagonists according to the invention also include catalytic RNA, or a 
ribozyme (See, e.g., PCT Mternational Publication WO 90/11364, pubUshed October 4, 
1990; Sarver et al , Science 247: 1222-1225 (1990). While ribozymes that cleave mRNA at 
site specific recognition sequences can be used to destroy METH 1 and/or METH2 mRN As, 
the use of hammerhead ribozymes is preferred. Hammerhead ribozymes cleave mRN As at 
locations dictated by flanking regions that form complementary base pairs with the target 
mRNA. The sole requirement is that the target mRNA have the following sequence of two 
bases: 5'-UG-3'. The construction and production of hammerhead ribozymes is well known 
in the art and is described more fully in Haseloff and Gerlach, Nature 534:585-591 (1988). 
There are numerous potential hammerhead ribozyme cleavage sites within the nucleotide 
sequence of METHl and/or METH2 (Figures 1 and 2). Preferably, the ribozyme is 
engineered so that the cleavage recognition site is located near the 5' end of the METHl 
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and/or METH2 mRNA; i,e., to increase efficiency and minimize the intracellular 
accumulation of non-functional mRNA transcripts. 

As in the antisense approach, the ribozymes of the invention can be composed of 
modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should be 
5 delivered to cells which express METH 1 and/or METH2 in vivo, DNA constructs encoding 

the ribozyme may be introduced into the cell in the same manner as described above for the 
introduction of antisense encoding DNA. A preferred method of delivery involves using a 
DNA construct "encoding" the ribozyme under the control of a strong constitutive promoter, 
such as, for example, pol HI or pol II promoter, so that transfected cells will produce 
10 sufficient quantities of the ribozyme to destroy endogenous METHl and/or METH2 

messages and inhibit translation. Since ribozymes, unlike antisense molecules, are catalytic, 
a lower intracellular concentration is required for efficiency. 

Antagonist/agonist compounds may be employed to inhibit the cell growth and 
proliferation effects of the polypeptides of the present invention on neoplastic cells and 
1 5 tissues, i.e. stimulation of angiogenesis of tumors, and, therefore, retard or prevent abnormal 

cellular growth and proliferation, for example, in tumor formation or growth. 

The antagonist/agonist may also be employed to prevent hyper- vascular diseases, and 
prevent the proliferation of epithelial lens cells after extracapsular cataract surgery. 
Prevention of the mitogenic activity of the polypeptides of the present invention may also 
20 be desirous in cases such as restenosis after balloon angioplasty. 

The antagonist/agonist may also be employed to prevent the growth of scar tissue 
during wound healing. 

The antagonist/agonist may also be employed to treat the diseases described herein. 

Other Activities 

25 As stated below, METH 1 and METH2 share structural and sequence homology with 

memebrs of the ADAM family. ADAM proteins have been shown to proteolytically process 
membrane-anchored proteins, including TNF (Black et a/., Nature 385:129 (1997); Moss 
et al , Nature 385:733 (1997)). Thus, METHl and/or METH2 may be useful in proteolytic 
processing of membrane-anchored proteins. Membrane-anchored proteins which may be 

30 proteolytically processed by METHl and/or METH2 include cytokines, growth factors, 

cytokine receptors and growth factor receptors. 
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METHl or METH2 polypeptides or polynucleotides, or agonists or antagonists of 
METHl and/or METH2, may also increase or decrease the differentiation or proliferation 
of embryonic stem cells, besides, as discussed above, hematopoietic lineage. 

METHl or METH2 polypeptides or polynucleotides, or agonists or antagonists of 
METHl and/or METH2, may also be used to modulate mammalian characteristics, such as 
body height, weight, hair color, eye color, skin, percentage of adipose tissue, pigmentation, 
size, and shape (e.g., cosmetic surgery). Similarly, METHl or METH2 polypeptides or 
polynucleotides, or agonists or antagonists of METHl and/or METH2, may be used to 
modulate mammalian metabolism affecting catabolism, anabolism, processing, utilization, 
and storage of energy. 

As angiogenesis is a key factor in supporting adipose tissue, METHl or METH2 
polypeptides or polynucleotides, or agonists or antagonists of METHl and/or METH2 may 
be used to control weight, reduce weight, treat obesity, and/or control adipose tissue in an 
individual. 

METHl or METH2 polypeptides or polynucleotides, or agonists or antagonists of 
METHl and/or METH2, may be used to change a manmial's mental state or physical state 
by influencing biorhythms, circadian rhythms, depression (including depressive disorders), 
tendency for violence, tolerance for pain, reproductive capabilities (preferably by Activin 
or Inhibin-like activity), hormonal or endocrine levels, appetite, libido, memory, stress, or 
other cognitive qualities. 

METHl or METH2 polypeptides or polynucleotides, or agonists or antagonists of 
METHl and/or METH2, may also be used as a food additive or preservative, such as to 
increase or decrease storage capabilities, fat content, lipid, protein, carbohydrate, vitamins, 
minerals, cof actors or other nutritional components. 

The above-recited applications have uses in a wide variety of hosts. Such hosts 
include, but are not limited to, human, murine, rabbit, goat, guinea pig, camel, horse, mouse, 
rat, hamster, pig, micro-pig, chicken, goat, cow, sheep, dog, cat, non-human primate, and 
human. In specific embodiments, the host is a mouse, rabbit, goat, guinea pig, chicken, rat, 
hamster, pig, sheep, dog or cat. In preferred embodiments, the host is a mammal. In most 
preferred embodiments, the host is a human. 
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Anti-angiogenesis 

As shown in Examples 4 and 5, METHl and METH2 inhibit angiogenesis. Thus, 
the present invention provides a method of treating an angiogenesis-related disease and/or 
disorder, comprising administering to an individual in need thereof a therapeutically 
effective amount of a METHl and/or METH2 polynucleotide, polypeptide, and/or agonist. 

For example, METH 1 and/or METH2 polynucleotides, polypeptides, and/or agonists 
may be utilized in a variety of additional methods in order to therapeutically treat a cancer 
or tumor. Cancers which may be treated with METHl and/or METH2 polynucleotides, 
polypeptides and/or agonists include, but are not limited to solid tumors, including prostate, 
lung, breast, ovarian, stomach, pancreas, larynx, esophagus, testes, liver, parotid, biUary 
tract, colon, rectum, cervix, uterus, endometrium, kidney, bladder, thyroid cancer; primary 
tumors and metastases; melanomas; glioblastoma; Kaposi's sarcoma; leiomyosarcoma; non- 
small cell lung cancer; colorectal cancer; advanced malignancies; and blood born tumors 
such as leukemias. For example, METHl and/or METH2 polynucleotides, polypeptides, 
and/or agonists maybe delivered topically, in order to treat cancers such as skin cancer, head 
and neck tumors, breast tumors, and Kaposi^s sarcoma. Within yet other aspects, METHl 
and/or METH2 polynucleotides, polypeptides, and/or agonists may be utilized to treat 
superficial forms of bladder cancer by, for example, intravesical administration, METHl 
and/or METH2 polynucleotides, polypeptides and/or agonists may be delivered directly into 
the tumor, or near the tumor site, via injection or a catheter. Of course, as the artisan of 
ordinary skill will appreciate, the appropriate mode of administration will vary according 
to the cancer to be treated. Other modes of delivery are discussed herein. 

METHl and/or METH2 polynucleotides, polypeptides and/or agonists maybe useful 
in treating other disorders, besides cancers, which involve angiogenesis. These disorders 
include, but are not limited to: benign tumors, for example hemangiomas, acoustic 
neuromas, neurofibromas, trachomas, and pyogenic granulomas; artheroscleric plaques; 
ocular angiogenic diseases, for example, diabetic retinopathy, retinopathy of prematurity, 
macular degeneration, corneal graft rejection, neovascular glaucoma, retrolental fibroplasia, 
rubeosis, retinoblastoma, uvietis and Pterygia (abnormal blood vessel growth) of the eye; 
rheumatoid arthritis; psoriasis; delayed wound healing; endometriosis; vasculogenesis; 
granulations; hypertrophic scars (keloids); nonunion fractures; scleroderma; trachoma; 
vascular adhesions; myocardial angiogenesis; coronary collaterals; cerebral collaterals; 
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arteriovenous malformations; ischemic limb angiogenesis; Osier-Webber Syndrome; plaque 
neovascularization; telangiectasia; hemophiliac joints; angiofibroma; fibromuscular 
dysplasia; wound granulation; Crohn's disease; and atherosclerosis. 

For example, within one aspect of the present invention methods are provided for 
treating hypertrophic scars and keloids, comprising the step of administering a METHl 
and/or METH2 polynucleotide, polypeptide, and/or agonist to a hypertrophic scar or keloid. 
Within one embodiment of the present invention METHl and/or METH2 polynucleotides, 
polypeptides, and/or agonists are directly injected into a hypertrophic scar or keloid, in order 
to prevent the progression of these lesions. This therapy is of particular value in the 
prophylactic treatment of conditions which are known to result in the development of 
hypertrophic scars and keloids (e.g., burns), and is preferably initiated after the proliferative 
phase has had time to progress (approximately 14 days after the initial injury), but before 
hypertrophic scar or keloid development. 

As noted above, the present invention also provides methods for treating neovascular 
diseases of the eye, including for example, corneal neovascularization, neovascular 
glaucoma, proliferative diabetic retinopathy, retrolental fibroplasia and macular 
degeneration, as well as other eye inflammatory diseases, ocular tumors and diseases 
associated with choroidal or iris neovascularization. See, e.g., reviews by Waltman et al. 
Am. J. Ophthal S5:704-710 (1978) and Gartner et al, Surv, Ophthal 22:291-312 (1978). 

Thus, within one aspect of the present invention methods are provided for treating 
neovascular diseases of the eye such as corneal neovascularization (including corneal graft 
neovascularization), comprising the step of administering to a patient a therapeutically 
effective amount of a METH 1 and/or METH2 compound (as described above) to the cornea, 
such that the formation of blood vessels is inhibited. Briefly, the cornea is a tissue which 
normally lacks blood vessels, hi certain pathological conditions however, capillaries may 
extend into the cornea from the pericorneal vascular plexus of the limbus. When the cornea 
becomes vascularized, it also becomes clouded, resulting in a decline in the patient's visual 
acuity. Visual loss may become complete if the cornea completely opacitates. 

A wide variety of disorders can result in corneal neovascularization, including for 
example, corneal infections (e.g., trachoma, herpes simplex keratitis, leishmaniasis and 
onchocerciasis), immunological processes (e.g., graft rejection and Stevens-Johnson's 
syndrome), alkali bums, trauma, inflammation (of any cause), toxic and nutritional 
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deficiency states, and as a complication of wearing contact lenses. 

Within particularly preferred embodiments of the invention, METH 1 and/ or METH2 
maybe prepared for topical administration in saline (combined with any of the preservatives 
and antimicrobial agents commonly used in ocular preparations), and administered in 
eyedrop form. The solution or suspension may be prepared in its pure form and administered 
several times daily. Alternatively, anti-angiogenic compositions, prepared as described 
above, may also be administered directly to the cornea. Within preferred embodiments, the 
anti-angiogenic composition is prepared with a muco-adhesive polymer which binds to 
cornea. Within further embodiments, the anti-angiogenic factors or anti-angiogenic 
compositions may be utilized as an adjunct to conventional steroid therapy. 

Topical therapy may also be useful prophylactically in corneal lesions which are 
known to have a high probability of inducing an angiogenic response (such as chemical 
burns). In these instances the treatment, likely in combination with steroids, may be 
instituted immediately to help prevent subsequent complications. 

Within other embodiments, the compounds described above may be injected directly 
into the corneal stroma by an ophthalmologist under microscopic guidance. The preferred 
site of injection may vary with the morphology of the individual lesion, but the goal of the 
administration would be to place the composition at the advancing front of the vasculature 
(i.e., interspersed between the blood vessels and the normal cornea). In most cases this 
would involve perilimbic corneal injection to "protect" the cornea from the advancing blood 
vessels. This method may also be utilized shortly after a corneal insult in order to 
prophylactically prevent corneal neovascularization. In this situation the material could be 
injected in the perilimbic cornea interspersed between the corneal lesion and its undesired 
potential limbic blood supply. Such methods may also be utilized in a similar fashion to 
prevent capillary invasion of transplanted corneas. In a sustained-release form injections 
might only be required 2-3 times per year. A steroid could also be added to the injection 
solution to reduce inflammation resulting from the injection itself 

Within another aspect of the present invention, methods are provided for treating 
neovascular glaucoma, comprising the step of administering to a patient a therapeutically 
effective amount of a METHl and/or METH2 polypeptide, polynucleotide, and/or agonist 
to the eye, such that the formation of blood vessels is inhibited. In one embodiment, the 
compound may be administered topically to the eye in order to treat early forms of 
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neovascular glaucoma. Within other embodiments, the compound may be implanted by 
injection into the region of the anterior chamber angle. Within other embodiments, the 
compound may also be placed in any location such that the compound is continuously 
released into the aqueous humor. 

Within another aspect of the present invention, methods are provided for treating 
proliferative diabetic retinopathy, comprising the step of administering to a patient a 
therapeutically effective amount of a METHl and/or METH2 polynucleotide, polypeptide, 
and/or agonist to the eyes, such that the formation of blood vessels is inhibited. Within 
particularly preferred embodiments of the invention, proliferative diabetic retinopathy may 
be treated by injection into the aqueous humor or the vitreous, in order to increase the local 
concentration of the METHl and/or METH2 polynucleotide, polypeptide, and/or agonist in 
the retina. Preferably, this treatment should be initiated prior to the acquisition of severe 
disease requiring photocoagulation. 

Within another aspect of the present invention, methods are provided for treating 
retrolental fibroplasia, comprising the step of administering to a patient a therapeutically 
effective amount of a METHl and/or METH2 polynucleotide, polypeptide, and/or agonist 
to the eye, such that the formation of blood vessels is inhibited. The compound may be 
administered topically, via intravitreous injection and/or via intraocular implants. 

METHl and/or METH2 polynucleotides, polypeptides and/or agonists may be used 
to treat diseases that have angiogenesis as a pathologic consequence such as cat scratch 
disease (Rochele minaliaquintosa), ulcers (Helicobacter pylori), Bartonellosis andbacillary 
angiomatosis. 

METHl and/or METH2 polynucleotides, polypeptides and/or agonists may be used 
as a birth control agent by preventing vascularization required for embryo implantation. In 
one aspect of the birth control method, an amount of the compound sufficient to block 
embryo implantation is administered before or after intercourse and fertilization have 
occurred, thus providing an effective method of birth control, possibly a "morning after" 
method. 

METHl and/or METH2 polynucleotides, polypeptides and/or agonists may also be 
used in controlling menstruation or administered as either a peritoneal lavage fluid or for 
peritoneal implantation in the treatment of endometriosis. 
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METHl and/or METH2 polynucleotides, polypeptides, and/or agonists of the present 
invention may be incorporated into surgical sutures in order to prevent stitch granulomas. 

METHl and/or and METH2 polynucleotides, polypeptides, and/or agonists may be 
utilized in a wide variety of surgical procedures. For example, within one aspect of the 
present invention a METHl and/or METH2 compositions (in the form of, for example, a 
spray or film) may be utilized to coat or spray an area prior to removal of a tumor, in order 
to isolate normal surrounding tissues from malignant tissue, and/or to prevent the spread of 
disease to surrounding tissues. Within other aspects of the present invention, METH 1 and/ or 
METH2 compositions (e.g., in the form of a spray) may be delivered via endoscopic 
procedures in order to coat tumors, or inhibit angiogenesis in a desired locale. Within yet 
other aspects of the present invention, surgical meshes which have been coated with anti- 
angiogenic compositions of the present invention may be utilized in any procedure wherein 
a surgical mesh might be utilized. For example, within one embodiment of the invention a 
surgical mesh laden with an anti-angiogenic composition may be utilized during abdominal 
cancer resection surgery (e.g., subsequent to colon resection) in order to provide support to 
the structure, and to release an amount of the anti-angiogenic factor. 

Within further aspects of the present invention, methods are provided for treating 
tumor excision sites, comprising administering a METHl and/or METH2 polynucleotide, 
polypeptide, and/or agonist to the resection margins of a tumor subsequent to excision, such 
that the local recurrence of cancer and the formation of new blood vessels at the site is 
inhibited. Within one embodiment of the invention, the anti-angiogenic compound is 
administered directly to the tumor excision site (e.g., applied by swabbing, brushing or 
otherwise coating the resection margins of the tumor with the anti-angiogenic compound). 
Alternatively, the anti-angiogenic compounds may be incorporated into known surgical 
pastes prior to administration. Within particularly preferred embodiments of the invention, 
the anti-angiogenic compounds are applied after hepatic resections for malignancy, and after 
neurosurgical operations. 

Within one aspect of the present invention, METH 1 and/or METH2 polynucleotides, 
polypeptides, and/or agonists may be administered to the resection margin of a wide variety 
of tumors, including for example, breast, colon, brain and hepatic tumors. For example, 
within one embodiment of the invention, anti-angiogenic compounds may be administered 
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to the site of a neurological tumor subsequent to excision, such that the formation of new 
blood vessels at the site are inhibited. 

The METHl and/or METH2 polynucleotides, polypeptides, and/or agonists of the 
present invention may also be administered along with other anti-angiogenic factors. 
5 Representative examples of other anti-angiogenic factors include: Anti-Invasive Factor, 

retinoic acid and derivatives thereof, paclitaxel. Suramin, Tissue Lihibitor of 
Metalloproteinase-1, Tissue Lihibitor of Metalloproteinase-2, Plasminogen Activator 
Inhibitor- 1, Plasminogen Activator Inhibitor-2, and various forms of the lighter "d group" 
transition metals. 

10 Lighter "d group" transition metals include, for example, vanadium, molybdenum, 

tungsten, titanium, niobium, and tantalum species. Such transition metal species may form 
transition metal complexes. Suitable complexes of the above-mentioned transition metal 
species include oxo transition metal complexes. 

Representative examples of vanadium complexes include oxo vanadium complexes 

15 such as vanadate and vanadyl complexes. Suitable vanadate complexes include 

metavanadate and orthovanadate complexes such as, for example, ammonium metavanadate, 
sodium metavanadate, and sodium orthovanadate. Suitable vanadyl complexes include, for 
example, vanadyl acetylacetonate and vanadyl sulfate including vanadyl sulfate hydrates 
such as vanadyl sulfate mono- and trihydrates. 

20 Representative examples of tungsten and molybdenum complexes also include oxo 

complexes. Suitable oxo tungsten complexes include tungstate and tungsten oxide 
complexes. Suitable tungstate complexes include ammonium tungstate, calcium tungstate, 
sodium tungstate dihydrate, and tungstic acid. Suitable tungsten oxides include tungsten (IV) 
oxide and tungsten (VI) oxide. Suitable oxo molybdenum complexes include molybdate, 

25 molybdenum oxide, and molybdenyl complexes. Suitable molybdate complexes include 

ammonium molybdate and its hydrates, sodium molybdate and its hydrates, and potassium 
molybdate and its hydrates. Suitable molybdenum oxides include molybdenum (VI) oxide, 
molybdenum (VI) oxide, and molybdic acid. Suitable molybdenyl complexes include, for 
example, molybdenyl acetylacetonate. Other suitable tungsten and molybdenum complexes 

30 include hydroxo derivatives derived from, for example, glycerol, tartaric acid, and sugars. 

A wide variety of other anti-angiogenic factors may also be utilized within the 
context of the present invention. Representative examples include platelet factor 4; 
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protamine sulphate; sulphated chitin derivatives (prepared from queen crab shells), (Murata 
etal , Cancer Res, 51 '.ll-ie, 199 1); Sulphated Polysaccharide Peptidoglycan Complex (SP- 
PG) (the function of this compound may be enhanced by the presence of steroids such as 
estrogen, and tamoxifen citrate); Staurosporine; modulators of matrix metabolism, including 
5 for example, proline analogs, cishydroxyproline, d,L-3,4-dehydroproline, Thiaproline, a,a- 

dipyridyl, 6-aminopropionitrile fumarate; 4-propyl-5-(4-pyridinyl)-2(3H)-oxazolone; 
Methotrexate.; Mitoxantrone; Heparin; hiterferons; 2 Macroglobuhn-serum; ChIMP-3 
(Pavloff et al, J. Bio. Chem. 2(57:17321-17326, 1992); Chymostatin (Tomkinson et al, 
Biochem J, 25(5:475-480, 1992); 6-Cyclodextrin Tetradecasulfate; Eponemycin; 

10 Camptothecin; Fumagillin (Ingber et al. Nature 548:555-557, 1990); Gold Sodium 

Thiomalate ("GST"; Matsubara and Ziff, /. Clin, Invest. 79:1440-1446, 1987); 6-1- 
anticoUagenase-serum; cc2-antiplasmin (Holmes et al, J. Biol Chem. 262(4): 1659-1664, 
1987); Bisantrene (National Cancer Institute); Lobenzarit disodium (N-(2)-carboxyphenyl-4- 
chloroanthronilic acid disodium or "CCA"; Takeuchi et al. Agents Actions 3(5:312-316, 

15 1992); Thalidomide; Angostatic steroid; AGM-1470; carboxynaminolmidazole; and 

metalloproteinase inhibitors such as BB94. 



Diagnostic Methods 

The invention also relates to the use of METHl or METH2 polynucleotides for use 
as diagnostic reagents. Detection of a mutated form of the METHl or METH2 gene 

20 associated with a dysfunction will provide a diagnostic tool that can add to or define a 

diagnosis of a disease or susceptibility to a disease which results ft-om under-expression, 
overexpression or altered expression of METH 1 or METH2. hidividuals carrying mutations 
in the METH 1 or METH2 gene may be detected at the DNA level by a variety of techniques. 
Nucleic acids for diagnosis may be obtained from a subject's cells, such as from 

25 blood, urine, saliva, tissue biopsy or autopsy material. The genomic DNA may be used 

directly for detection or may be amplified enzymatically by using PCR or other amplification 
techniques prior to analysis. RNA or cDNA may also be used in similar fashion. Deletions 
and insertions can be detected by a change in size of the amplified product in comparison 
to the normal genotype. Point mutations can be identified by hybridizing amplified DNA 



-275- 



to labeled METHl or METH2 nucleotide sequences. Perfectly matched sequences can be 
distinguished from mismatched duplexes by RNase digestion or by differences in melting 
temperatures. DNA sequence differences may also be detected by alterations in 
electrophoretic mobility of DNA fragments in gels, with or without denaturing agents, or by 
5 direct DNA sequencing. See, e.g., Myers et al, Science 250:1242 (1985). Sequence 

changes at specific locations may also be revealed by nuclease protection assays, such as 
RNase and SI protection or the chemical cleavage method. See Cotton et al, Proc Natl 
Acad Sci USA 85:4397-4401 (1985). In another embodiment, an array of oligonucleotides 
probes comprising METHl or METH2 nucleotide sequence or fragments thereof can be 

10 constructed to conduct efficient screening of e.g., genetic mutations. Array technology 

methods are well known and have general applicability and can be used to address a variety 
of questions in molecular genetics including gene expression, genetic linkage, and genetic 
variability. (See, for example, M. Chee et at, Science 274:610-613 (1996). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility 

15 to angiogenic diseases (cancer, cancer metastasis, chronic inflammatory disorders, 

rheumatoid arthritis, altherosclerosis, macular degeneration, diabetic retinopathy), restenosis, 
Alzheimer's disease and tissue remodeling through detection of mutation in the METHl or 
METH2 gene by the methods described. 

In addition, angiogenic diseases (cancer, cancer metastasis, chronic inflammatory 

20 disorders, rheumatoid arthritis, altherosclerosis, macular degeneration, diabetic retinopathy), 

restenosis, Alzheimer's disease and tissue remodeling can be diagnosed by methods 
comprising determining from a sample derived from a subject an abnormally decreased or 
increased level of the METHl or METH2 polypeptide or METHl or METH2 mRNA. 
Decreased or increased expression can be measured at the RNA level using any of the 

25 methods well known in the art for the quantitation of polynucleotides, such as, for example, 

PGR, RT-PCR, RNase protection, Northem blotting and other hybridization methods . Assay 
techniques that can be used to determine levels of a protein, such as an METHl or METH2 
polypeptide, in a sample derived from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive-binding assays. Western Blot 

30 analysis and ELIS A assays. 
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Cancer Diagnosis and Prognosis 



It is believed that certain tissues in mammals with cancer express significantly 
diminished levels of the METHl or METH2 protein and mRNA encoding the METHl or 
METH2 protein when compared to a corresponding "standard" mammal, i.e., a mammal of 
5 the same species not having the cancer. Further, it is believed that diminished levels of the 

METHl or METH2 protein can be detected in certain body fluids (e.g., sera, plasma, urine, 
and spinal fluid) from mammals with cancer when compared to sera from mammals of the 
same species not having the cancer. Thus, the invention provides a diagnostic method useful 
during tumor diagnosis, which involves assaying the expression level of the gene encoding 

10 the METHl protein in mammalian cells or body fluid and comparing the gene expression 

level with a standard METHl gene expression level, whereby a decrease in the gene 
expression level under the standard is indicative of certain tumors. The invention also 
provides a diagnostic method useful during tumor diagnosis, which involves assaying the 
expression level of the gene encoding the METH2 protein in mammalian cells or body fluid 

15 and comparing the gene expression level with a standard METH2 gene expression level, 

whereby a decrease in the gene expression level under the standard is indicative of certain 
tumors. 

Where a tumor diagnosis has already been made according to conventional methods, 
the present invention is useful as a prognostic indicator, whereby patients exhibiting 
20 diminished METHl or METH2 gene expression will experience a worse clinical outcome 

relative to patients expressing the gene at a lower level. 

By "assaying the expression level of the gene encoding the METHl or METH2 
protein" is intended qualitatively or quantitatively measuring or estimating the level of the 
METHl or METH2 protein or the level of the mRNA encoding the METHl or METH2 
25 protein in a first biological sample either directly (e.g., by determining or estimating absolute 

protein level or mRNA level) or relatively (e.g., by comparing to the METHl or METH2 
protein level or mRNA level in a second biological sample). 

Preferably, the METHl or METH2 protein level or mRNA level in the first 
biological sample is measured or estimated and compared to a standard METHl or METH2 
30 protein level or mRNA level, the standard being taken from a second biological sample 

obtained from an individual not having the cancer. As will be appreciated in the art, once 
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a standard METHl or METH2 protein level or mRNA level is known, it can be used 
repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 
individual, cell line, tissue culture, or other source which contains METHl or METH2 
5 protein or mRNA. Biological samples include mammalian body fluids (such as sera, 

plasma, urine, synovial fluid and spinal fluid) which contain secreted mature METHl or 
METH2 protein, and adrenal, thyroid, stomach, brain, heart, placenta, lung, liver, muscle, 
kidney, pancreas , testis and ovarian tissue (for METH 1 ) ; and prostate, small intestine, colon, 
brain and lung tissue (for METH2). 
10 The present invention is useful for detecting cancer in mammals. In particular the 

invention is useful during diagnosis of the of following types of cancers in mammals: breast, 
ovarian, prostate, liver, lung, pancreatic, colon, and testicular. Preferred mammals include 
monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. Particularly preferred 
are humans. 

15 Total cellular RNA can be isolated from a biological sample using the single-step 

guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski and Sacchi, 
Anal Biochem, 162:156-159 (1987). Levels of mRNA encoding the METHl or METH2 
protein are then assayed using any appropriate method. These include Northern blot analysis 
(Harada et al, Cell 63:303-312 (1990)), SI nuclease mapping (Fujita et al, Cell 49:351- 

20 367 ( 1 987)), the polymerase chain reaction (PCR), reverse transcription in combination with 

the polymerase chain reaction (RT-PCR) (Makino etal, Technique 2:295-301 (1990)), and 
reverse transcription in combination with the ligase chain reaction (RT-LCR). 

Assaying METHl or METH2 protein levels in a biological sample can occur using 
antibody-based techniques. For example, METHl or METH2 protein expression in tissues 

25 can be studied with classical immunohistological methods (Jalkanen, M., et ah, J. Cell, 

Biol 101:976-985 (1985); Jalkanen, M„ etal, J. Cell . Biol 105:3081-3096 (1987)). 

Other antibody-based methods useful for detecting METH 1 or METH2 protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay (ELIS A) 
and the radioinomunoassay (RIA). 

30 Suitable labels are known in the art and include enzyme labels, such as, glucose 

oxidase, and radioisotopes, such as iodine (^^^I, ^^^I), carbon (^^^C), sulfur (^^S), tritium (^H), 
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indium (^^^In), and technetium (^^""Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Vaccines 

Another aspect of the invention relates to a method for inducing an immunological 
5 response in a mammal which comprises inoculating the manomal with METHl or METH2 

polypeptide, or a fragment thereof, adequate to produce antibody and/or T cell immune 
response to protect said animal from angiogenic diseases (cancer, cancer metastasis, chronic 
inflammatory disorders, rheumatoid arthritis, altherosclerosis, macular degeneration, 
diabetic retinopathy), restenosis, Alzheimer's disease and tissue remodeling, among others. 

1 0 Yet another aspect of the invention relates to a method of inducing immunological response 

in a mammal which comprises delivering METHl or METH2 polypeptide via a vector 
directing expression of METHl or METH2 polynucleotide in vivo in order to induce such 
an immunological response to produce antibody to protect such animal from diseases. 

Further aspect of the invention relates to an immunological/vaccine formulation 

15 (composition) which, when introduced into a mammalian host, induces an immunological 

response in that mammal to a METHl or METH2 polypeptide wherein the composition 
comprises a METHl or METH2 polypeptide or METHl or METH2 gene. The vaccine 
formulation may further comprise a suitable carrier. Since METHl or METH2 polypeptide 
may be broken down in the stomach, it is preferably administered parenterally (including 

20 subcutaneous, intramuscular, intravenous, intradermal etc. injection). Formulations suitable 

for parenteral administration include aqueous and non-aqueous sterile injection solutions 
which may contain anti-oxidants, buffers, bacteriostats and solutes which render the 
formulation isotonic with the blood of the recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents or thickening agents. The formulations 

25 may be presented in unit-dose or multi-dose containers, for example, sealed ampules and 

vials and may be stored in a freeze-dried condition requiring only the addition of the sterile 
liquid carrier immediately prior to use. The vaccine formulation may also include adjuvant 
systems for enhancing the immunogenicity of the formulation, such as oil-in water systems 
and other systems known in the art. The dosage will depend on the specific activity of the 

30 vaccine and can be readily determined by routine experimentation. 
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Modes of administration 

It is recognized than an increase in the vascular supply plays a central role in tumor 
progression and metastasis; therefore, inhibitors of angiogenesis can prove effective as 
adjuvant therapy for cancer patients. Some of the currently recognized angiogenic 
5 suppressors are poor candidates for systemic treatment due to severe collateral effect. The 

present inventors have found that METH 1 and METH2 are potent inhibitors of angiogenesis 
both in vitro and in vivo. The advantage of METH 1 and METHl is that these inhibitors are 
normally associated with suppression of physiological angiogenesis; therefore, they offer 
lack of toxicity and endothelial specificity over other angiogenic inhibitors. Furthermore, 

10 METHl and METH2 present a restricted pattern of expression providing a possible 

advantage on organ specificity. 

Accordingly, the polypeptides of the present invention may be employed to treat 
cancer. The METHl and METH2 polypeptides of the present invention can also be used 
to treat individuals with other disorders that are related to angiogenesis, including abnormal 

15 wound healing, inflammation, rheumatoid arthritis, psoriasis, endometrial bleeding 

disorders, diabetic retinopathy, some forms of macular degeneration, hemangiomas, and 
arterial- venous malformations. 

Thus, the invention provides a method of inhibiting angiogenesis in an individual 
comprising administering to such an individual a pharmaceutical composition comprising 

20 an effective amount of an isolated METH 1 polypeptide of the invention, effective to increase 

the METHl activity level in such an individual. The invention also provides a method of 
inhibiting angiogenesis in an individual comprising administering to such an individual a 
pharmaceutical composition comprising an effective amount of an isolated METH2 
polypeptide of the invention, effective to increase the METH2 activity level in such an 

25 individual. 

METHl polypeptides which may be used to inhibit angiogenesis in this manner 
include: METHl polypeptide encoded by the deposited cDNA including the leader; the 
mature METHl polypeptide encoded by the deposited the cDNA minus the leader (i.e., the 
mature protein); a polypeptide comprising amino acids about 1 to about 950 in SEQ ID 

30 NO:2; a polypeptide comprising amino acids about 2 to about 950 in SEQ ID NO:2; a 

polypeptide comprising amino acids about 29 to about 950 in SEQ ID NO:2; a polypeptide 
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comprising amino acids about 30 to about 950 in SEQ ID NO:2; a polypeptide comprising 
the metalloprotease domain of METHl, amino acids 235 to 459 in SEQ ID NO:2; a 
polypeptide comprising the disintegrin domain of METHl, amino acids 460 to 544 in SEQ 
ID NO:2; a polypeptide comprising the first TSP4ike domain of METHl, amino acids 545 
5 to 598 in SEQ ID NO:2; apolypeptide comprising the second TSP-like domain of METHl , 

amino acids 841 to 894 in SEQ ID NO:2; a polypeptide comprising the third TSP-like 
domain of METHl, amino acids 895 to 934 in SEQ ID NO:2; a polypeptide comprising 
amino acids 536 to 613 in SEQ ID NO:2; apolypeptide comprising amino acids 549 to 563 
in SEQ ID NO:2; a polypeptide comprising amino acids 542 to 894 of SEQ ID NO:2; and 

10 a polypeptide comprising amino acids 801 to 950 of SEQ ID NO:2. 

METH2 polypeptides which may be used to inhibit angiogenesis in this manner 
include: the METH2 polypeptide encoded by the deposited cDNA including the leader; the 
mature METH2 polypeptide encoded by the deposited the cDNA minus the leader (i.e., the 
mature protein); a polypeptide comprising amino acids about 1 to about 890 in SEQ ID 

15 NO:4; a polypeptide comprising amino acids about 2 to about 890 in SEQ ID NO:4; a 

polypeptide comprising amino acids about 24 to about 890 in SEQ ID NO:4; a polypeptide 
comprising amino acids about 1 12 to about 890 in SEQ ID NO:4; a polypeptide comprising 
the metalloprotease domain of METH2, amino acids 214 to 439 in SEQ ID NO:4; a 
polypeptide comprising the disintegrin domain of METH2, amino acids 440 to 529 in SEQ 

20 ID NO:4; a polypeptide comprising the first TSP-like domain of METH2, amino acids 530 

to 583 in SEQ ID NO:4; a polypeptide comprising the second TSP-like domain of METH2, 
amino acids 837 to 890 in SEQ ID N0:4; apolypeptide comprising amino acids 280 to 606 
in SEQ ID NO:4; and a polypeptide comprising amino acids 529 to 548 in SEQ ED NO:4. 
Also included are METH 1 or METH2 proteins lacking TSP3 ; a METH 1 or METH2 

25 protein lacking TSP2 and TSP3; a METHl or METH2 protein lacking TSP3, TSP2, and 

TSPl; a METHl or METH2 protein lacking the cysteine-rich domain, TSPl, TSP2, and 
TSP3; a METHl or METH2 protein lacking the metalloprotease domain, the cysteine-rich 
domain, TSPl, TSP2 and TSP3; and a METHl or METH2 protein lacking the prodomain, 
the metalloprotease domain, the cysteine-rich domain, TSPl , TSP2, and TSP3. Finally, any 

30 combination of these domains are also preferred. For example, the cysteine-rich domain of 

METHl may be combined with 1, 2, or 3 TSP domains of METHl. The cysteine-rich 
domain of METH2 may be combined with 1, 2, or 3 TSP domain of METH2. The 
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metalloprotease domain and the cysteine-rich domain of METHl maybe combined with 1 ,2 
or 3 TSP domains of METHL The metalloprotease domain and the cysteine-rich domain 
of METH2 may be combined with 1 ,2 or 3 TSP domains of METH2. The prodomain, the 
metalloprotease domain, and the cysteine-rich domain of METHl may be combined with 
1,2 or 3 TSP domains of METHl. The prodomain, the metalloprotease domain, and the 
cysteine-rich domain of METH2 may be combined with 1 ,2 or 3 TSP domains of METH2, 
The signal sequence, the prodomain, the metalloprotease domain, and the cysteine-rich 
domain of METHl maybe combined with 1,2, or 3 TSP domains of METHL The signal 
sequence, the prodomain, the metalloprotease domain, and the cysteine-rich domain of 
METH2 may be combined with 1,2, or 3 TSP domains of METH2. 

As a general proposition, the total pharmaceutically effective amount of METHl or 
METH2 polypeptide administered parenterally per dose will be in the range of about 1 
|Lig/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will be 
subject to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 
most preferably for humans between about 0,01 and 1 mg/kg/day for the polypeptide. If 
given continuously, the METHl or METH2 polypeptide is typically administered at a dose 
rate of about 1 |ig/kg/hour to about 50 |ag/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous bag 
solution may also be employed. 

Pharmaceutical compositions containing the METHl or METH2 of the invention 
may be administered orally, rectally, parenterally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, drops or transdermal patch), bucally, 
or as an oral or nasal spray. By "pharmaceutically acceptable carrier" is meant a non-toxic 
solid, semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of 
any type. The term "parenteral" as used herein refers to modes of administration which 
include intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion. 

Gene Therapy Methods 

Another aspect of the present invention is to gene therapy methods for treating 
disorders, diseases and conditions. The gene therapy methods relate to the introduction of 
nucleic acid (DNA, RN A and antisense DNA or RN A) sequences into an animal to achieve 



-282- 



expression of the METH 1 and/or METH2 polypeptide of the present invention. This method 
requires a polynucleotide which codes for a METH 1 and/or METH2 polypeptide operatively 
linked to a promoter and any other genetic elements necessary for the expression of the 
polypeptide by the target tissue. Such gene therapy and delivery techniques are known in the 
art, see, for example, WO90/1 1092, which is herein incorporated by reference. 

Thus, for example, cells from a patient may be engineered with a polynucleotide 
(DNA or RNA) comprising a promoter operably linked to a METHl and/or METH2 
polynucleotide ex vivo, with the engineered cells then being provided to a patient to be 
treated with the polypeptide. Such methods are well-known in the art. For example, see 
Belldegrun, A., etal.J. Natl Cancer Inst 85:207-216 (1993); Ferrantini, M. et a/., Cancer 
Research 53: 1 107- 1 1 12 (1993); Ferrantini, M. etal , J. Immunology 153: 4604-4615 (1994); 
Kaido, T., etal, Int J, Cancer 60: 221-229 (1995); Ogura, H., etal. Cancer Research 50: 
5102-5106 (1990); Santodonato, L., et al. Human Gene Therapy 7:1-10 (1996); 
Santodonato, L., et al. Gene Therapy 4: 1246-1255 (1997); and Zhang, J.-F. et al. Cancer 
Gene Therapy 3: 31-38 (1996)), which are herein incorporated by reference. In one 
embodiment, the cells which are engineered are arterial cells. The arterial cells may be 
reintroduced into the patient through direct injection to the artery, the tissues surrounding 
the artery, or through catheter injection. 

As discussed in more detail below, the METHl and/or METH2 polynucleotide 
constructs can be delivered by any method that delivers injectable materials to the cells of 
an animal, such as, injection into the interstitial space of tissues (heart, muscle, skin, lung, 
liver, and the like) . The METH 1 and/or METH2 polynucleotide constructs may be delivered 
in a pharmaceutically acceptable liquid or aqueous carrier. 

In one embodiment, the METHl and/or METH2 polynucleotide is delivered as a 
naked polynucleotide. The term "naked" polynucleotide, DNA or RNA refers to sequences 
that are free from any delivery vehicle that acts to assist, promote or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the METH 1 and/or METH2 polynucleotides can 
also be dehvered in liposome formulations and lipofectin formulations and the like can be 
prepared by methods well known to those skilled in the art. Such methods are described, for 
example, in U.S. Patent Nos. 5,593,972, 5,589,466, and 5,580,859, which are herein 
incorporated by reference. 
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The METHl and/or METH2 polynucleotide vector constructs used in the gene 
therapy method are preferably constructs that will not integrate into the host genome nor will 
they contain sequences that allow for replication. Appropriate vectors include pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; pSVK3, pBPV, pMSG and 
pS VL available from Pharmacia; and pEF W5, pcDNA3. 1 , and pRc/CMV2 available from 
Invitrogen. Other suitable vectors will be readily apparent to the skilled artisan. 

Any strong promoter known to those skilled in the art can be used for driving the 
expression of METHl and/or METH2 DNA. Suitable promoters include adenoviral 
promoters, such as the adenoviral major late promoter; or heterologous promoters, such as 
the cytomegalovirus (CMV) promoter; the respiratory syncytial virus (RSV) promoter; 
inducible promoters, such as the MMT promoter, the metallothionein promoter; heat shock 
promoters; the albumin promoter; the ApoAI promoter; human globin promoters; viral 
thymidine kinase promoters, such as the Herpes Simplex thymidine kinase promoter; 
retroviral LTRs; the b-actin promoter; and human growth hormone promoters. The 
promoter also may be the native promoter for METHl and/or METH2. 

Unlike other gene therapy techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory nature of the polynucleotide 
synthesis in the cells. Studies have shown that non-replicating DNA sequences can be 
introduced into cells to provide production of the desired polypeptide for periods of up to 
six months. 

The METHl and/or METH2 polynucleotide construct can be delivered to the 
interstitial space of tissues within the an animal, including of muscle, skin, brain, lung, Uver, 
spleen, bone marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall 
bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue, hiterstitial space of the tissues comprises the intercellular, fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers in the 
walls of vessels or chambers, collagen fibers of fibrous tissues, or that same matrix within 
connective tissue ensheathing muscle cells or in the lacunae of bone. It is similarly the space 
occupied by the plasma of the circulation and the lymph fluid of the lymphatic channels. 
Delivery to the interstitial space of muscle tissue is preferred for the reasons discussed below. 
They may be conveniently delivered by injection into the tissues comprising these cells. They 
are preferably delivered to and expressed in persistent, non-dividing cells which are 
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differentiated, although delivery and expression may be achieved in non-differentiated or less 
completely differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. 
In vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked acid sequence injection, an effective dosage amount of DNA or RNA 
will be in the range of from about 0.05 mg/kg body weight to about 50 mg/kg body weight. 
Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and more preferably 
from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of ordinary skill will 
appreciate, this dosage will vary according to the tissue site of injection. The appropriate and 
effective dosage of nucleic acid sequence can readily be determined by those of ordinary skill 
in the art and may depend on the condition being treated and the route of administration. 

The preferred route of administration is by the parenteral route of injection into the 
interstitial space of tissues. However, other parenteral routes may also be used, such as, 
inhalation of an aerosol formulation particularly for delivery to lungs or bronchial tissues, 
throat or mucous membranes of the nose, Li addition, naked METHl and/or METH2 DNA 
constructs can be delivered to arteries during angioplasty by the catheter used in the 
procedure. 

The naked polynucleotides are deUvered by any method known in the art, including, 
but not limited to, direct needle injection at the delivery site, intravenous injection, topical 
administration, catheter infusion, and so-called "gene guns". These delivery methods are 
known in the art. 

The constructs may also be delivered with delivery vehicles such as viral sequences, 
viral particles, liposome formulations, lipofectin, precipitating agents, etc. Such methods of 
delivery are known in the art. 

hi certain embodiments, the METHl and/or METH2 polynucleotide constructs are 
complexed in a liposome preparation. Liposomal preparations for use in the instant 
invention include cationic (positively charged), anionic (negatively charged) and neutral 
preparations. However, cationic liposomes are particularly preferred because a tight charge 
complex can be formed between the cationic liposome and the polyanionic nucleic acid. 
Cationic liposomes have been shown to mediate intracellular delivery of plasmid DNA 
(Feigner et ai, Proc. Natl Acad. ScL USA (1987) 84:7413-7416, which is herein 
incorporated by reference); mRNA (Malone et a/., Proc, Natl Acad, ScL USA (1989) 
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86:6077-6081, which is herein incorporated by reference); and purified transcription factors 
(Debs et al, Biol Chem. (1990) 265:10189-10192, which is herein incorporated by 
reference), in functional form. 

Cationic liposomes are readily available. For example, N[l-2,3-dioleyloxy)propyl]- 
N,N,N-triethylammonium (DOTMA) liposomes are particularly useful and are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, N. Y. (See, also, Feigner 
et al. Proa Natl Acad. Set USA (1987) 84:7413-7416, which is herein incorporated by 
reference). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
DOTAP/DOPE (Boehringer). 

Other cationic liposomes can be prepared from readily available materials using 
techniques well known in the art. See, e.g. PCX Publication No. WO 90/11092 (which is 
herein incorporated by reference) for a description of the synthesis of DOTAP (1,2- 
bis(oleoyloxy)-3-(trimethylammonio)propane) Uposomes. Preparation of DOTMA 
liposomes is explained in the literature, see, e.g., P. Feigner et al, Proc, Natl Acad. ScL 
USA 84:74 1 3 -74 1 7 , which is herein incorporated by reference. Similar methods can be used 
to prepare liposomes from other cationic lipid materials. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, Ala.), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl, choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

For example, commercially dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) can 
be used in various combinations to make conventional liposomes, with or without the 
addition of cholesterol. Thus, for example, DOPG/DOPC vesicles can be prepared by drying 
50 mg each of DOPG and DOPC under a stream of nitrogen gas into a sonication vial. The 
sample is placed under a vacuum pump overnight and is hydrated the following day with 
deionized water. The sample is then sonicated for 2 hours in a capped vial, using a Heat 
Systems model 350 sonicator equipped with an inverted cup (bath type) probe at the 
maximum setting while the bath is circulated at 15EC. Alternatively, negatively charged 
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vesicles can be prepared without sonication to produce multilamellar vesicles or by 
extrusion through nucleopore membranes to produce unilamellar vesicles of discrete size. 
Other methods are known and available to those of skill in the art. 

The liposomes can comprise multilamellar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs), with SUVs being preferred. The 
various liposome-nucleic acid complexes are prepared using methods well known in the art. 
See, e.g., Straubinger et al Methods of Immunology (1983), 101:512-527, which is herein 
incorporated by reference. For example, MLVs containing nucleic acid can be prepared by 
depositing a thin film of phospholipid on the walls of a glass tube and subsequently 
hydrating with a solution of the material to be encapsulated. SUVs are prepared by extended 
sonication of MLVs to produce a homogeneous population of unilamellar liposomes. The 
material to be entrapped is added to a suspension of preformed MLVs and then sonicated. 
When using liposomes containing cationic lipids, the dried lipid film is resuspended in an 
appropriate solution such as sterile water or an isotonic buffer solution such as 10 mM 
Tris/NaCl, sonicated, and then the preformed liposomes are mixed directly with the DNA. 
The liposome and DNA form a very stable complex due to binding of the positively charged 
liposomes to the cationic DNA. SUVs find use with small nucleic acid fragments. LUVs are 
prepared by a number of methods, well known in the art. Conmionly used methods include 
Ca^'^-EDTA chelation (Papahadjopoulos et a/., Biochim, Biophys. Acta (1975) 394:483; 
Wilson et al. Cell (1979) 17:77); ether injection (Deamer, D. and Bangham, A., Biochim, 
Biophys. Acta (1976) 443:629; Ostro et al, Biochem, Biophys, Res. Commun, (1977) 
76:836; Fraley etal.Proc. Natl Acad, ScL USA (1979) 76:3348); detergent dialysis (Enoch, 
H. and Strittmatter, P., Proc, Natl Acad, ScL USA (1979) 76:145); and reverse-phase 
evaporation (REV) (Fraley et al, J. Biol Chem, (1980) 255:10431; Szoka, F. and 
Papahadjopoulos, D., Proc. Natl Acad. ScL USA (1978) 75:145; Schaefer-Ridder et al. 
Science (1982) 215:166), which are herein incorporated by reference. 

Generally, the ratio of DNA to liposomes will be from about 10:1 to about 1:10, 
Preferably, the ration will be from about 5: 1 to about 1:5. More preferably, the ratio will be 
about 3:1 to about 1:3. Still more preferably, the ratio will be about 1:1. 

U.S. PatentNo. 5,676,954 (which is herein incorporated by reference) reports on the 
injection of genetic material, complexed with cationic liposomes carriers, into mice. U.S. 
Patent Nos. 4,897,355, 4,946,787, 5,049,386, 5,459,127, 5,589,466, 5,693,622, 5,580,859, 



-287- 



5,703,055, and international publication no. WO 94/29469 (which are herein incorporated 
by reference) provide cationic lipids for use in transfecting DNA into cells and mammals. 
U.S. Patent Nos. 5,589,466, 5,693,622, 5,580,859, 5,703,055, and international publication 
no. WO 94/29469 (which are herein incorporated by reference) provide methods for 
delivering DNA-cationic lipid complexes to mammals. 

In certain embodiments, cells are engineered, ex vivo or in vivo, using a retroviral 
particle containing RNA which comprises a sequence encoding METHl and/or METH2. 
Retroviruses from which the retroviral plasmid vectors may be derived include, but are not 
limited to, Moloney Murine Leukemia Virus, spleen necrosis virus, Rous sarcoma Virus, 
Harvey Sarcoma Virus, avian leukosis virus, gibbon ape leukemia virus, human 
immunodeficiency virus, Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell lines to form 
producer cell lines. Examples of packaging cells which may be transfected include, but are 
not limited to, the PE501, PA317, R-2, R-AM, PA12, T19-14X, VT-19-17-H2, RCRE, 
RCRBP, GP+E-86, GP+envAml2, and DAN cell lines as described in Miller, Human Gene 
Therapy i:5-14 (1990), which is incorporated herein by reference in its entirety. The vector 
may transduce the packaging cells through any means known in the art. Such means 
include, but are not limited to, electroporation, the use of liposomes, and CaP04 
precipitation. In one alternative, the retroviral plasmid vector may be encapsulated into a 
liposome, or coupled to a lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles which include 
polynucleotide encoding METHl and/or METH2. Such retroviral vector particles then may 
be employed, to transduce eukaryotic cells, either in vitro or in vivo. The transduced 
eukaryotic cells will express METHl and/or METH2. 

In certain other embodiments, cells are engineered, ex vivo or in vivo, with METHl 
and/or METH2 polynucleotide contained in an adenovirus vector. Adenovirus can be 
manipulated such that it encodes and expresses METHl and/or METH2, and at the same 
time is inactivated in terms of its ability to replicate in a normal lytic viral Ufe cycle. 
Adenovirus expression is achieved without integration of the viral DNA into the host cell 
chromosome, thereby alleviating concerns about insertional mutagenesis. Furthermore, 
adenoviruses have been used as live enteric vaccines for many years with an excellent safety 
profile (Schwartz, A. R. et al (1974) Am. Rev. Respir. DisJ09:233-23Sy Finally, 
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adenovirus mediated gene transfer has been demonstrated in a number of instances including 
transfer of alpha- 1 -antitrypsin and CFTR to the lungs of cotton rats (Rosenfeld, M. A. et al 
(1991) Science 252:431-434; Rosenfeld et a/., (1992) Cell 65:143-155). Furthermore, 
extensive studies to attempt to establish adenovirus as a causative agent in human cancer 
were uniformly negative (Green, M. et al (1979) Proc. Natl Acad, Set USA 76:6606). 

Suitable adenoviral vectors useful in the present invention are described, for 
example, in Kozarsky and Wilson, Curr. Opin, Genet. Devel 3:499-503 (1993); Rosenfeld 
et al. Cell 68:143-155 (1992); Engelhardt et al, Human Genet Ther. 4:759-769 (1993); 
Yang etal, Nature Genet. 7:362-369 (1994); Wilson etal, Nature 365:691-692 (1993); and 
U.S. Patent No. 5,652,224, which are herein incorporated by reference. For example, the 
adenovirus vector Ad2 is useful and can be grown in human 293 cells. These cells contain 
the El region of adenovirus and constitutively express Ela and Elb, which complement the 
defective adenoviruses by providing the products of the genes deleted from the vector. In 
addition to Ad2, other varieties of adenovirus (e.g., Ad3, Ad5, and Ad7) are also useful in 
the present invention. 

Preferably, the adenoviruses used in the present invention are replication deficient. 
Replication deficient adenoviruses require the aid of a helper virus and/or packaging cell line 
to form infectious particles. The resulting virus is capable of infecting cells and can express 
a polynucleotide of interest which is operably linked to a promoter, but cannot replicate in 
most cells. Replication deficient adenoviruses may be deleted in one or more of all or a 
portion of the following genes: Ela, Elb, E3, E4, E2a, or LI through L5. 

In certain other embodiments, the cells are engineered, ex vivo or in vivo, using an 
adeno-associated virus (AAV). AAVs are naturally occurring defective viruses that requke 
helper viruses to produce infectious particles (Muzyczka, N., Curr. Topics in Microbiol 
Immunol 158:91 (1992)). It is also one of the few viruses that may integrate its DNA into 
non-dividing cells. Vectors containing as little as 300 base pairs of AAV can be packaged 
and can integrate, but space for exogenous DNA is limited to about 4.5 kb. Methods for 
producing and using such AAVs are known in the art. See, for example, U.S. Patent Nos. 
5,139,941, 5,173,414, 5,354,678, 5,436,146, 5,474,935, 5,478,745, and 5,589,377. 

For example, an appropriate AAV vector for use in the present invention will include 
all the sequences necessary for DNA replication, encapsidation, and host-cell integration. 
The METH 1 and/or METH2 polynucleotide construct is inserted into the AAV vector using 
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standard cloning methods, such as those found in Sambrook et al. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press ( 1 989). The recombinant AAV vector is then 
transfected into packaging cells which are infected with a helper virus, using any standard 
technique, including lipofection, electroporation, calcium phosphate precipitation, etc. 

5 Appropriate helper viruses include adenoviruses, cytomegaloviruses, vaccinia viruses, or 

herpes viruses. Once the packaging cells are transfected and infected, they will produce 
infectious AAV viral particles which contain the METHl and/or METH2 polynucleotide 
construct. These viral particles are then used to transduce eukaryotic cells, either ex vivo 
or in vivo. The transduced cells will contain the METHl and/or METH2 polynucleotide 

10 construct integrated into its genome, and will express METHl and/or METH2. 

Another method of gene therapy involves operably associating heterologous control 
regions and endogenous polynucleotide sequences (e.g. encoding METHl and/or METH2) 
via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; 
International Publication No. WO 96/29411, pubUshed September 26, 1996; International 

15 Publication No. WO 94/12650, published August 4, 1994; Koller et a/., Prac. Natl Acad, 

Sci, USA S<5: 8932-8935 (1989); eindZijlstm etaL, Nature 542:435-438 (1989). This method 
involves the activation of a gene which is present in the target cells, but which is not 
normally expressed in the cells, or is expressed at a lower level than desired. 

Polynucleotide constructs are made, using standard techniques known in the art, 

20 which contain the promoter with targeting sequences flanking the promoter. Suitable 

promoters are described herein. The targeting sequence is sufficiently complementary to an 
endogenous sequence to permit homologous recombination of the promoter-targeting 
sequence with the endogenous sequence. The targeting sequence will be sufficiently near 
the 5' end of the METHl and/or METH2 desired endogenous polynucleotide sequence so 

25 the promoter will be operably linked to the endogenous sequence upon homologous 

recombination. 

The promoter and the targeting sequences can be amplified using PGR, Preferably, 
the amplified promoter contains distinct restriction enzyme sites on the 5' and 3' ends. 
Preferably, the 3' end of the first targeting sequence contains the same restriction enzyme site 
30 as the 5' end of the amphfied promoter and the 5' end of the second targeting sequence 

contains the same restriction site as the 3' end of the amplified promoter. The amplified 
promoter and targeting sequences are digested and ligated together. 
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The promoter-targeting sequence construct is delivered to the cells, either as naked 
polynucleotide, or in conjunction with transfection-facilitating agents, such as liposomes, 
viral sequences, viral particles, whole viruses, lipofection, precipitating agents, etc., 
described in more detail above. The P promoter-targeting sequence can be delivered by any 
method, included direct needle injection, intravenous injection, topical administration, 
catheter infusion, particle accelerators, etc. The methods are described in more detail below. 

The promoter-targeting sequence construct is taken up by cells. Homologous 
recombination between the construct and the endogenous sequence takes place, such that an 
endogenous METHl and/or METH2 sequence is placed under the control of the promoter. 
The promoter then drives the expression of the endogenous METHl and/or METH2 
sequence. 

The polynucleotides encoding METHl and/or METH2 may be administered along 
with other polynucleotides encoding other proteins. Such proteins include, but are not 
limited to, acidic and basic fibroblast growth factors, VEGF- 1 , VEGF-2, VEGF-3, VEGF-E, 
PIGF 1 and 2, epidermal growth factor alpha and beta, platelet-derived endothelial cell 
growth factor, platelet-derived growth factor alpha and beta, tumor necrosis factor alpha, 
hepatocyte growth factor, insulin like growth factor, colony stimulating factor, macrophage 
colony stimulating factor, granulocyte/macrophage colony stimulating factor, and nitric 
oxide synthase. 

Preferably, the polynucleotide encoding METHl and/or METH2 contains a secretory 
signal sequence that facilitates secretion of the protein. Typically, the signal sequence is 
positioned in the coding region of the polynucleotide to be expressed towards or at the 5' end 
of the coding region. The signal sequence may be homologous or heterologous to the 
polynucleotide of interest and may be homologous or heterologous to the cells to be 
transfected. Additionally, the signal sequence may be chemically synthesized using methods 
known in the art. 

Any mode of administration of any of the above-described polynucleotides 
constructs can be used so long as the mode results in the expression of one or more 
molecules in an amount sufficient to provide a therapeutic effect. This includes direct 
needle injection, systemic injection, catheter infusion, bioUstic injectors, particle accelerators 
(i.e., "gene guns"), gelfoam sponge depots, other commercially available depot materials, 
osmotic pumps (e.g., Alza minipumps), oral or suppositorial solid (tablet or pill) 
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pharmaceutical formulations, and decanting or topical applications during surgery. For 
example, direct injection of naked calcium phosphate-precipitated plasmid into rat liver and 
rat spleen or a protein-coated plasmid into the portal vein has resulted in gene expression 
of the foreign gene in the rat livers (Kaneda et al. Science 243:315 (1989)). 

A preferred method of local administration is by direct injection. Preferably, a 
recombinant molecule of the present invention complexed with a delivery vehicle is 
administered by direct injection into or locally within the area of arteries. Administration of 
a composition locally within the area of arteries refers to injecting the composition 
centimeters and preferably, millimeters within arteries. 

Another method of local administration is to contact a polynucleotide construct of 
the present invention in or around a surgical wound. For example, a patient can undergo 
surgery and the polynucleotide construct can be coated on the surface of tissue inside the 
wound or the construct can be injected into areas of tissue inside the wound. 

Therapeutic compositions useful in systemic administration include recombinant 
molecules of the present invention complexed to a targeted delivery vehicle of the present 
invention. Suitable delivery vehicles for use with systemic administration comprise 
liposomes comprising ligands for targeting the vehicle to a particular site. 

Preferred methods of systemic administration, include intravenous injection, aerosol, 
oral and percutaneous (topical) delivery. Litravenous injections can be performed using 
methods standard in the art. Aerosol delivery can also be performed using methods standard 
in the art (see, for example, Stribling et al, Proc. Natl, Acad, ScL USA iS9: 11277-1 1281 
(1992), which is incorporated herein by reference). Oral delivery can be performed by 
complexing a polynucleotide construct of the present invention to a carrier capable of 
withstanding degradation by digestive enzymes in the gut of an animal. Examples of such 
carriers, include plastic capsules or tablets, such as those known in the art. Topical delivery 
can be performed by mixing a polynucleotide construct of the present invention with a 
lipophiUc reagent (e.g., DMSO) that is capable of passing into the skin. 

Determining an effective amount of substance to be delivered can depend upon a 
number of factors including, for example, the chemical structure and biological activity of 
the substance, the age and weight of the animal, the precise condition requiring treatment 
and its severity, and the route of administration. The frequency of treatments depends upon 
a number of factors, such as the amount of polynucleotide constructs administered per dose, 
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as well as the health and history of the subject. The precise amount, number of doses, and 
timing of doses will be determined by the attending physician or veterinarian. 

Therapeutic compositions of the present invention can be administered to any 
animal, preferably to mammals and birds. Preferred mammals include humans, dogs, cats, 
mice, rats, rabbits sheep, cattle, horses and pigs, with humans being particularly preferred. 

Chromosome Assays 

The nucleic acid molecules of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can hybridize with 
a particular location on an individual human chromosome. The mapping of DNAs to 
chromosomes according to the present invention is an important first step in correlating 
those sequences with genes associated with disease. 

In certain preferred embodiments in this regard, the cDNA herein disclosed is used 
to clone genomic DNA of a METHl or METH2 protein gene. This can be accomplished 
using a variety of well known techniques and libraries, which generally are available 
commercially. The genomic DNA then is used for in situ chromosome mapping using well 
known techniques for this purpose. 

In addition, in some cases, sequences can be mapped to chromosomes by preparing 
PGR primers (preferably 15-25 bp) from the cDNA. Computer analysis of the 3' 
untranslated region of the gene is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, thus complicating the amplification process. These primers 
are then used for PGR screening of somatic cell hybrids containing individual human 
chromosomes. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in one step. 
This technique can be used with probes from the cDNA as short as 50 or 60 bp. For a 
review of this technique, see Verma et al, Human Chromosomes: A Manual Of Basic 
Techniques, Pergamon Press, New York (1988). 

Other mapping strategies that can similarly be used to map to its chromosome 
include radiation hybrid mapping, prescreening with labeled flow-sorted chromosomes and 
preselection by hybridization to construct chromosome specific-cDNA libraries. Radiation 
hybrid (RH) mapping relies upon fragmentation of human chromosomes with X-rays, and 
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retention of these random fragments in Hamster A23 host cells. The DNAs for RH mapping 
are supplied by Research Genetics (USA). Oligo pairs are designed from EST sequences 
that will amplify products of between 80bp and 300bp. The PCRs are performed on 93 
human/hamster hybrid DNAs and the results compared with a framework map (http://www- 
genome, wi.mit.edu/cgi-bin/contig/rhmapper.pl; Gyapay et al. Human Molecular Genetics 
5:39-346 (1996)), RH mapping provides greater precision than FISH and indicates clusters 
of genes as well as disease locus/gene correlations. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, for example, in V. McKusick, Mendelian Inheritance In Man, available 
on-line through Johns Hopkins University, Welch Medical Library. The relationship 
between genes and diseases that have been mapped to the same chromosomal region are then 
identified through linkage analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic sequence 
between affected and unaffected individuals. If a mutation is observed in some or all of the 
affected individuals but not in any normal individuals, then the mutation is likely to be the 
causative agent of the disease. 

The METHl gene maps between STS markers D21S1435 and D21S1442 which 
translates as 21q21, This is a similar chromosomal location to amyloid precursor protein 
(APP). APP and METHl are approximately 3 million bases apart which is not a massive 
distance in human genomics. The chromosomal location includes important genes such as 
enterokinases (enzymes that activate trypsinogen by converting it to trypsin) and genes 
responsible for Alzheimer's disease. 

The METHl gene can be mapped to 21q21 using the following oligos for radiation 
hybrid mapping: 

5' primer: ACTGTGTGTGATCCGAG 
3' primer: GTTGGAAAGCATTGACG 

Having generally described the invention, the same will be more readily understood 
by reference to the following examples, which are provided by way of illustration and are 
not intended as limiting. 
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Examples 

Example 1: Identification and Cloning ofMETHl and METH2 

To search for novel genes with TSP-like domains, a large human cDNA database 
consisting of approximately 900,000 expressed sequence tags (ESTs) was screened for 
5 sequences homologous to the second type I repeat of TSPl . Several ESTs were predicted 

to encode proteins with TSP-like domains. Two cDNA clones originated from human heart 
and lung libraries were further sequenced and chosen for functional analysis. 

The amino-terminal end of METHl was obtained using 5' rapid amplification of 
cDNA ends (RACE) PGR technique (Marathon cDNA amplification kit, Clontech) 

10 according to manufacturer instructions. The amino-terminal end of METH2 was obtained 

partially through 5'RACE PGR and later confirmed and completed by genomic screening. 
For the genomic screen, BAG clones (Genome Systems) were initially identified by PGR. 
Positive BAG clones containing 150-200bp of sequence were subsequently subcloned into 
pGEM vector as small fragments and sequenced. 

15 Analysis and comparison of the deduced amino acid sequence with the GenBank, 

EMBL and SwissProt databases suggested that these genes belong to a new family of 
metalloproteases with homology to the reprolysin family in their NH2- terminal end and with 
several TSP-like motifs in the GOOH-terminal end. These cDNAs were named METHl and 
METH2; ME, for metalloprotease and TH, for thrombospondin. The mouse homologue of 

20 METHl was identified and named AD AMTSl (Kuno,K.,e^a/., /. BioL Chem, 272:556-562 

(1997)). Direct comparison of the human and mouse sequences revealed a high level of 
conservation (83.4% amino acid identity). Thus far no homologues for METH2 have been 
identified. 

Interestingly, a recently identified protein named pNPI (procollagen I N-proteinase; 
25 (Golidge, A., et al, Proc, Natl. Acad, Set USA 94:2374-2379 (1997)) showed a striking 

sequence and structural similarity to METHl and METH2 (Figure 3). As the novel proteins 
described here, pNPI also contains metalloproteinase (reprolysin subfamily) and TSP 
domains at the carboxy-terminal end. Although the sequence for pNPI is of bovine origin, 
sequence alignment revealed identical structural features. The amino acid similarity 
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between METHl and METH2 is 51.7%, and between METHl or METH2 and pNPI the 
homology is lesser 33.9% and 36.3%, respectively. 

Sequence analysis showed that the ORE of METHl and METH2 coded for proteins 
of 950 and 890 amino acids, respectively. In all three proteins, the NH2 terminal end 
5 contains a putative signal peptide followed by another putative transmembrane domain 

around amino acid 300, deduced from the hydrophilicity plots. It is not clear whether these 
proteins are bound to the membrane. However, given preliminary data, it is more likely that 
this second transmembrane domain will consist of a hydrophobic pocket and that METHl, 
METH2 and pNPI are in fact secreted proteins. The NH^-terminal end past the signal 

10 peptide has homology to the superfamily of zinc metalloproteases and can be subdivided in 

a prodomain, a metalloprotease domain, and a cysteine-rich region. 

The double underlined sequence in METHl (amino acids 232-235) and METH2 
(amino acids 211-214) in Figure 3 localized at the boundary between the prodomain and the 
metalloprotease domain, are potential cleavage sites for mammalian subtilisins, such as 

15 furins (Barr, 1991). Proteolytical processing occurs in SVMPs to yield soluble 

metalloproteases and disintegrins (Bjarnason, J.B. &Fox, J. Methods EnzymoL 248:345- 
368 (1995)) and has also been detected in some ADAMs (reviewed by Wolsberg, T.G. & 
White, J.M., Developmental Biology 780:389-401 (1996)). Proteolytical processing occurs 
in both METHl and 2 (see below). Additionally, both METHl and METH2 present a 7n^^ 

20 -binding site (dotted line in Figure 3) that is presumed to be catalytically active due to the 

conservation of certain functionally important amino acids (Rawlings, N.D. & Barrett, A. J., 
Methods EnzymoL 248:183-228 (1995)) suggesting that these proteins may be active 
proteases. 

Following the metalloprotease domain, there is a cysteine-rich region which contains 
25 two putative disintegrin loops (Wolsberg, T.G. & White, J.M., Developmental Biology 

780:389-401 (1996)) (marked by arrows in Figure 3). Disintegrin domains are found within 
the superfamily of metalloproteases in snake venom metalloproteases (SVMPs) and ADAMs 
(mammalian proteins containing a disintegrin and a metalloprotease domain) and have a 
possible function inhibiting binding of integrins to their ligands in SVMPs. Conversely, the 
30 ADAM-disintegrin-like domain, as part of membrane anchored proteins, may promote rather 

than disrupt, cell-cell interactions (Wolsberg, T.G. & White, J.M,, Developmental Biology 
780:389-401 (1996)). The TSP-like domains are located in the COOH-half of METHl and 
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METH2 proteins. METHl contains two conserved TSP domains separated by a spacer 
region with unknown function, and a subdomain with less homology, and only 5 cysteines, 
following the second anti-angiogenic region. METH2 contains two TSP domains separated 
by the spacer region. The alignment of the TSP-like domains of METHl and METH2 with 
5 those of TSPl and TSP2 are shown in Figure 5. The homology varies between 19.2% to 

52% amino acid similarity among all the TSP repeats. The cysteines, numbered 1 to 6, and 
the tryptophans, labeled by asterisks, are highly conserved. 

Southern blot of human genomic DNA revealed the presence of METHl and 
METH2 in the genome. METHl and METH2 probes revealed bands of different size 

10 suggesting that they are transcribed from different genes. 

The consensus sequence for the type I repeats includes 16 residues with 6 perfectly 
conserved cysteines. Typically it begins with the sequence motif WSXWS (SEQ ID NO: 82) 
that has also been shown to bind to heparin (Guo, N., etal, J. Biol Chem, 267\ 19349-19355 
(1992)). The affinity of this region to heparin has been proposed to the part of the anti- 

15 angiogenic activity of TSP-1 (Guo, N., et al, J- Peptide Res. 49 (1997)). Among the five 

members of the TSP family of proteins, only TSP-1 and TSP-2 inhibit angiogenesis and 
contain the type I repeats (Tolsma, S.S., etai, J, Cell Biol 722:497-511 (1993); Kyriakides, 
T.R., et al, J. Cell Biol 140:419-430 (1998)). The type I or properdin repeats were 
probably added to the precursor of TSPl and 2 by exon shuffling between 500 and 900 

20 million years ago (Adams, J., et al, The Thrombospondin Gene Family, 1 Ed. Molecular 

Biology Intelligence Unit (Springer, Ed,), R.G. Landes Company, Germany (1995)). It is 
likely that the acquisition of this domain provided the precursor of TSPl and TSP2 with 
functions, such as regulation of new vessel formation. More recently, BAI-1 (brain 
angiogenic inhibitor- 1), a protein isolated from a brain library for its ability to be regulated 

25 by p53, has also been shown to contain the type I repeat of TSP-1 and to provide anti- 

angiogenic potential to this molecule (Nishimori, H., et al, Oncogene 75:2145-2150 
(1997)). Nevertheless, it appears that additional sequences or context are also important, 
since other proteins containing the type I repeats appear not to have clear or more established 
anti-angiogenic properties such as: properdin, F-spondin, and other members of the 

30 complement family. 

Because of the presence of TSP-repeats in METHl and METH2, along with their 
anti-angiogenic properties, these proteins were originally considered members of the TSP 
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superfamily. Nevertheless, they have no additional homology to other TSPs, and in fact, the 
similarity to TSPl and TSP2 is restricted to the type I repeats. Furthermore, the proteins 
also have strong sequence and structural homology to members of the ADAM family. These 
features led Kuno and colleagues to name ADAMTS to the mouse homolog of METHl 
5 (Kuno, K., et al, J. Biol. Chem. 272:556-562 (1997)). The recent identification of pNPI and 

its striking sequence homology to the proteins here described, prompt all these three proteins 
to be grouped in a subfamily named metallospondins. At this point, it is not clear whether 
pNIP has anti-angiogenic properties or whether METHl and/or METH2 participate in the 
cleavage of the amino terminal pro-peptide of a 1(1) procollagen. 

1 0 Example 2: Northern and Southern Blot Analysis 

Total RNA was purified from cells by guanidinium-isothiocyanate extraction, as 
previously described (Chomczynski, P. & Sacchi, ^,,Anal Biochem, 762: 156-159 (1987)) 
Poly(A)+RNA was extracted using a Boehringer Mannheim (BMB, Lidianapolis, IN) kit 
according to the manufacturer conditions. Other poly(A)+RNA blots were purchased from 

15 Clontech (Palo Alto, CA). Pre-hybridization was performed in a solution containing: 50% 

formamide, 6X SSPE, IX Denhardt's solution, 0T% SDS and 100|ag/ml of heat denatured 
salmon sperm DNA for 12-18h at 42°C. Hybridization with labeled cDNA probes 
proceeded in the same solution at 42 °C for 1 2- 1 8h. TSP 1 and METH 1 probes corresponded 
to the entire human cDNAs. The METH2 probe corresponded to a Kpnl-EcoRl fragment 

20 from the human cDNA. A 1.3Kb Pstl fragment of the glyceraldehyde-3-phosphate- 

dehydrogenase (GPDH) was used to normalize for loading and transfer efficiency. 
Membranes were exposed to Kodak Biomax MS film (Kodak, New Haven, CT). 

For Southern blots, human genomic DNA, purchased from Promega (Madison, WI), 
was heated at 65 ""C for 10 min and digested with EcdRl and P^^I overnight at 37°C. 5|ig 

25 of digested DNA was separated in a 1 % agarose gel, transferred to a nytran membrane and 

cross-linked by ultraviolet light. cDNA probes, as well as, prehybridization and 
hybridization conditions were identical to those described for Northern blots. Blots were 
washed with high stringency (0.2X SSC, 0.2% SDS at 50"C). 

The expression patterns of METHl and METH2 were examined in both adult and 

30 embryonic tissues. Northern blot analysis was performed under high-stringency conditions 
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with blots that included poly(A)+RNA from human tissues. METHl and METH2 
transcripts revealed a single band of 4.6 and 3.7Kb, respectively. Abundant METHl mRNA 
expression was observed in adrenal, heart, placenta, followed by skeletal muscle, thyroid and 
stomach. From the embryonic tissues analyzed, kidney showed the highest expression of 
5 METH 1 mRNA. Nevertheless, weaker expression of METH 1 mRNA was seen in all tissues 

analyzed. Distribution of METH2 mRNA was more restricted and weaker than that of 
METHL The highest expression was seen in lung, both embryonic and adult. Interestingly, 
METHl and METH2 expression do not appear to overlap. In combination, the structural 
similarities and their pattern of expression suggest functional redundancy yet different 
10 transcriptional regulation. The expression levels of TSPl transcripts in the same blots were 

also analyzed, for purpose of comparison. TSPl mRNA highest expression was seen in the 
adult placenta and in all embryonic tissues analyzed. In contrast to METHl and METH2 
we observed constant levels of TSPl transcript in all the other tissues examined. 

The cell type distribution was also studied by Northern blot analysis of 
15 poly(A)+RNA. METHl mRNA was detectable, at low levels, in dermal fibroblasts, 

vascular smooth muscle, endometrial stromal cells, and in two cancer cell lines, HeLa and 
G63 1 , an adenocarcinoma and a melanoma, respectively. METH2 mRNA was detected only 
on SW480, a colon carcinoma cell line, but no expression was seen in any other of the cell 
lines or primary strains analyzed. 
20 The possibility that groups of angiogenic and anti-angiogenic factors regulate 

vascular network formation in specific organs has been a frequently discussed hypothesis 
likely to be true, yet unproven. The expression patterns of METHl and METH2, which are 
clearly distinct and almost non-overlapping, were puzzling, at least with concern to overall 
levels. TSPl and TSP2 also share identical structure, high level of amino acid similarity, 
25 yet their pattem of expression differs significantly (Iruela-Arispe, M.L., Z)ev. Dyn, 797:40-56 

(1993)). The differences are likely based on dissimilar cis-acting elements in their 
promoters and different regulatory mechanisms, as previously suggested. Although the 
promoters for METHl and 2 have not been characterized, it is likely that they provide 
unique features for the regulation of each gene. Nevertheless, the possibility that one motif, 
30 the anti-angiogenic / type I repeat, with demonstrated anti-angiogenic properties is present 

in several proteins with different tissue specificities is appealing. Alternatively, the small 
differences in sequence between closely related members of the same family could possess 
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significance that goes beyond functional redundancy. In the case of TSPl and TSP2, aside 
from the striking structural similarities and perhaps having functionally common anti- 
angiogenic properties, TSPl and TSP2 also appear to display functions of their own and not 
likely shared by their similar relative. This became evident with the outcome of the two 
5 knock-outs for these genes. TSPl null animals exhibited primarily lung disorders (Lawler, 

J., et al, J. Clin. Invest 707:982-992 (1998)) and secondarily vascular abnormalities, but 
only under specific pathological settings or on a restricted set of organs. In contrast TSP2 
knock-out mice exhibited unpredicted collagen assembly anomalies, with carry-on 
consequences to the skin, tendons, and bone (Kyriakides, T,R., etal, J, Cell Biol 140:419- 

10 430 (1998)). In addition, these animals also appear to have overall increase in capillary 

density in the dermis. It is not understood how the resemblance between the newly described 
members of the metallospondin fanoily translate functionally. Clearly, pNIP has been shown 
to display active proteolytic activity by cleaving the N-terminus of type I procollagen 
(Colidge, A., etaL, Proc. Natl Acad. Set USA 94:2374-2379 (1997)). 

15 A second region of functional interest corresponds to the disintegrin domain. This 

domain has been more fully characterized in related members of the snake venom 
metalloproteases that have been shown to bind to aUbps and inhibit platelet interaction 
blocking coagulation (Pfaff, M., etal, CellAdhes Commum 2:491-501 (1994); Usami, Y., 
et al, Biochem. Biophys. Res. Commum 207:331-339 (1994)). The disintegrin motif 

20 consists of a thirteen to fifteen domain which frequently contain an RGD or a negatively 

charged residue at the position of the aspartic acid. The RGD, or equivalent, binds to 
integrins and serve as antagonist or signaling ligands (Wolsberg, T.G. & White, J.M., 
Developmental Biology 780:389-401 (1996)), METH2, but not METHl, has an RGD 
sequence located amino-terminal to the disintegrin domain. In addition, both molecules 

25 present relatively high, but not perfect, degree of conservation of cysteines within the 

disintegrin motif. This appears to display an important role in the tertiary structure of this 
region and its ability to interact with integrins. In addition, some of these domains have 
been shown to act as functional adhesion molecules, particularly those with transmembrane 
regions (Wolsberg, T.G. & White, J.M., Developmental Biology 7S0:389-401 (1996)). It 

30 is unlikely that this will be the case for METHl and METH2, since both these proteins 

appear to be secreted. 
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Example 3: Expression and Purification of Recombinant Proteins 

Recombinant constructs for expression of His-tagged fusion proteins were generated 
for expression in bacteria. METHl nt 605-1839 (from ATG) was amplified by polymerase 
chain reaction using primers containing BamHI and Pstl sites and subcloned into the pRSET 
5 vector (Invitrogen, Carlsbad, CA). The construct was sequenced to verify frame and 

sequence fidelity and were then transformed into BL21;DE3 E, Coli strain (Stratagene 
Cloning Systems, La JoUa, CA). Purification was performed by affinity chromatography on 
Ni-NTA columns. Recombinant protein was eluted with 500mM imidazole in PBS, 
Fractions containing recombinant protein were dialyzed against phenol-red free DMEM and 

10 used to generate antisera. 

Antisera was generated by intramuscular injection of a 1 : 1 mixture of recombinant 
protein (500|ig/ml) and Freud's adjuvant. Eight animals, including five guinea pigs and 
three rabbits were injected every 15 days for three cycles. After the third injection, serum 
was evaluated for presence of anti-METHl antibodies, only two of the guinea pigs showed 

15 significant titers. The antibodies recognized recombinant protein on Western blots, were 

able to immunoprecipitate METHl protein from cell extracts and recognize the protein by 
immunocytochemistry. Pre-immune sera was always included as control. One of the guinea 
pig antibodies was also able to recognize METH2. 

For mammalian expression, full-length METHl and METH2 cDNA were cloned 

20 into pcDNA3. 1 expression vector (Invitrogen). The vector is under the regulatory control 

of the CMV promoter. Cloning was performed so that constructs contained their own 
termination codons. 

^ Recombinant protein was obtained by transient transfection of the expression vectors 
in 293T cells using standard calcium phosphate precipitation. Upon transfection, cells were 

25 incubated for 6 to 1 6h in serum-containing media and then switched to serum- free media for 

36h for accumulation of recombinant protein. As control, pcDNA3.1 vector alone was 
transiently transfected in parallel plates. Purification of the protein included 30% 
ammonium sulfate precipitation followed by dialysis on HS buffer (DB = lOmM HEPES, 
150mM NaCl, ImM CaCls and ImM MgS04). Samples were then subjected to heparin- 

30 affinity chromatography, Elution from heparin columns was achieved with HS buffer 

containing 550mM NaCl. Fractions were then loaded on 5-30% sucrose gradients and spun 
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at 48K. Separation on sucrose gradients was assessed by Western blotting and purity was 
determined by Coninimassie blue and silver nitrate staining. 

Generation of recombinant protein was initially done in bacteria. A METHl 
expression vector was generated containing an amino terminal His Tag to aid on the 
purification. The resulting protein coded for all METHl translated sequence except the 
prodomain. Affinity chromatography on Ni''"*'-beads showed an unique band of 68kD. 
Isolation and purification was always performed under denatured conditions and attempts 
to refold the protein met with little success, probably due to a significant number of 
intramolecular disulfide bonds associated with the large number of cysteines. Nonetheless, 
the protein was used to generate antibodies. From eight animals injected, only two were 
able to mount an immune response and generate specific antibodies, possibly due to the high 
conservation across species. Both antibodies recognized recombinant METHl protein 
before and after purification on Ni^"^ columns. The antibody was also used to evaluate 
expression of the protein on Western blots of cell lysates. A single band of approximately 
105-1 10 kD was detected in stromal fibroblasts and smooth muscle cells. 

To test the hypothesis that METHl and METH2 could function as regulators of 
angiogenesis, recombinant full length protein was generated in mammalian cells. Evaluation 
of correct reading frame and molecular weight was initially tested by in vitro translation. 
Translation of the METH-1 open reading frame revealed a 1 lOkD protein, slightly higher 
than the size predicted by translation of the cDNA sequence. As previously indicated, there 
are two putative glycosylation sites, the higher size of the protein is likely due to addition 
of sugar residues. Similarly, METH2 was also sUghtly higher than its predicted size, 
showing a 98kD protein. 

Recombinant proteins were isolated from 293T supernatants under native conditions 
to preserve secondary structure. From analysis of the deduced amino acid sequence and 
published information on the murine homolog, ADAMTS, it was predicted that both 
proteins could bind to heparin and used affinity chromatography for purification. Both cell 
layer and conditioned media of 293T cells transfected with METHl, METH2 and vector 
control were used for purification. The molecular weight of METHl and 2 were similar to 
those from the reticulocyte lysate. As predicted, both proteins are secreted. Interestingly, 
the media contains both full length (llOkD) and two processed forms of 85 and 67kD for 
METHl , and 79 and 64kD for METH2. The 85 and 79kD molecular weights agree with the 
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predicted size for both proteins after cleavage at the consensus sublisin site. However, a 
second processing event must take place to generate the most abundant fragments observed 
at 67 and 64kD respectively. These forms are stable after purification even in the absence 
of proteinase inhibitors. For purification, proteins v^ere initially concentrated by ammonium 
sulfate precipitation, followed by dialysis. The resulting protein suspension was then 
subjected to heparin-sepharose columns. Recombinant METHl and METH2 were eluted 
with washing buffer containing 550mM NaCl. Fractions contained both pro-METHl, as 
well as the processed forms. Because it was unclear whether processing was relevant for 
function of the proteins, both forms were separated on sucrose gradients. Both full-length 
and processed forms were used in angiogenesis assays. 

Recombinant constructs for expression of truncated fusion proteins were as follows: 
(1) pRSET-METHl-Type I: METHl nt 1605-1839 (from the start codon) was amplified by 
polymerase chain reaction using the following primers: 5 '-GCA TTT TGG ATC CGC CTT 
TTC ATG-3' (SEQ ID NO:78) and 5'-GTT GTG TGC TGC AGA TTG TTC C-3' (SEQ 
ID NO:79). The amplified fragment was then subcloned into the BamRI and Pstl sites of 
the pRSET vector; (2) pGEX-METHl-TSP was generated by ligating the BamHI-EcoRI 
fragment from the pRSET-METHl-TSP into the Smal site of the pGEX-5X vector 
(Pharmacia Biotech Inc. , Piscataway, NJ) by blunt-end ligation; (3) pGEX- 1 .0-METH2: the 
fragment nt 838-1818 of METH2 cDNA (from the start codon) was ligated into BamHl- 
EcoRl sites of pGEM-2TK. The METH2 fragment was amplified by PGR using the 
following primers: 5'-GAAAAATGGGGATCCGAGGTG-3' (SEQ ID NO:80) and 5'- 
GCAGGAGAATTCCGTCCATG-3' (SEQ ID NO:81) to generate BamUl and EcoRI 
restriction sites; (4) pGEX-METH2-TSP: a 0.5Kb Xmal-EcoRl fragment isolated from 
pGEX- 1 .0-METH2 was subcloned into the Xmal and EcoRl sites of pGEX-2TK vector. All 
constructs were sequenced to verify sequence fidelity and correct open reading frame. 

The recombinant proteins were named 6H-METH1, the recombinant protein 
expressed with the plasmid pRSET-METHl-TSP, GST-METHl , the protein expressed with 
the plasmid pGEX-METH 1 -TSP and GST-METH2, the protein expressed with the plasmid 
pGEX-METH2-TSP. 

Expression plasmids were transformed into BL21:DE3 E, coli strain (Stratagene 
Cloning Systems, La JoUa, CA) and fusion proteins were induced following manufacturer 
recommendations. Briefly, induced bacteria pellets were resuspended in PBS and sonicated 
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on ice for 1 min. The suspension was, subsequently, incubated at RT for 20niin in the 
presence of 1% triton X-100 and centrifuged at 4°C. Histidine tagged fusion proteins were 
then purified on Ni-NTA beads (Qiagen, Chatsworth, CA) by incubating 20ml of 
supernatant with 1ml of beads (50% slurry) for 2h at 4°C. The suspension was transferred 
into a column and washed with 10 columns volume of PBS containing lOmM imidazole, 
followed by 50mM imidazole and finally lOOmM imidazole. The protein was eluted with 
500mM imidazole in PBS, Fractions containing the recombinant protein were dialyzed 
against phenol-red free DMEM. Samples were centrifuged for 30min at 4''C, part of the 
protein was not soluble and was lost during centrifugation. The supernatant was stored at 
-70 °C and used for proliferation, cornea pocket and chorioallantoic membrane (CAM) 
assays. 

For purification of GST-fusion proteins, the extract was cleared by centrifugation and 
applied to a GST-affinity column (Pharmacia). The column was washed with PBS- 1 % triton 
X-100 in the presence of 0.1 mM reduced glutathione and, subsequently, with the same 
buffer in the presence of 0.5mM reduced glutathione. Fusion proteins were eluted with 
lOmM reduced glutathione in 50mM Tris-HCl, pH 7.5. Fractions containing the protein 
were dialyzed against DMEM, stored at -70 °C and used for proliferation, cornea pocket and 
chorioallantoic membrane (CAM) assays. 

Integrity and purity of recombinant proteins was analyzed in 12.5% or 15% 
acrylamide gels stained with Coomassie blue. 

A recombinant GST fusion protein containing the first two type I repeats of TSP was 
also dialyzed against DMEM before used in functional assays. Intact TSPl was purified 
from platelets as previously described (Roberts, D.D,, etal, J. Tissue Cult. Methods 16:211- 
222(1994)). 

To test the hypothesis that METHl and METH2 TSP domains could function as 
regulators of angiogenesis recombinant fusion proteins were generated in bacteria. The 
constructs included the first TSP domain of METHl or METH2. This domain is the most 
conserved, 52% amino acid similarity with the second type I repeat of TSPl, (this domain 
contains a putative binding site for CD36). All recombinant proteins were isolated under 
native conditions to preserve their secondary structure as much as possible. 6H-METH1 and 
GST-METHl contained the first TSP-like domain of METHl fused to a histidine tag or a 
GST, respectively. METHl recombinant protein was made with two different tags because 
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of purification and structural advantages. The differences in size are due to the size of the 
tag, 6KDa the histidine and 27KDa the GST. GST-METH2 contained the first TSP domain 
of METH2 also fused to a GST. A fragment corresponding to the last two type I repeats of 
TSPl, also fused to a GST, and intact TSPl purified from platelets were used as positive 
5 controls. In addition, GST alone was included in all experiments as negative control. 

Example 4: TSP domains in METHl and METH2 disrupt angiogenesis in vivo 

Cornea pocket assay 

Swiss Webster females and males, were purchased from Charles River (Boston, MA) 
and used between 8-10 weeks-old for implantation of the pellets. Cornea pockets were 

10 performed as described by Kenyon and colleagues (Kenyon, B.M., et al, Invest. 

Ophthalmol Vis, Set 37:1625-1632 (1996)) with few modifications. Briefly, a solution of 
lOfig of recombinant bFGF plus 5 mg of sucralfate were mixed with lOfil of Hydron 
(200mg/ml in ethanol; New Brunswick, NJ) and the recombinant protein of interest (2|ig). 
The suspension was then smeared onto a sterile nylon mesh square (pore size 500|Lim; Tetko 

15 Inc., Briarcliff Manor, NY) and allowed to dry for 30min. The fibers of the mesh were 

pulled to produce pellets of 500iim^ that were stored at -20 ° C. Uniformly sized pellets were 
selected under a microscope and used for the assays. 

Mice were anesthetized with Avertin. An incision was made in the cornea using a 
Nikon SMZ-U dissecting microscope with the aid of a surgical blade. A single pellet was 

20 implanted into the pocket. Five days after pellet implantation, corneal angiogenesis was 

evaluated and photographed. 



CAM assay 

Chorioallantoic membrane assays were performed on Leghorn chicken embryos 
(SPAFAS, MA) at 12-14 days of embryonic development. Matrigel (750|Lig/ml), VEGF 
25 (250ng/mesh) and the protein or peptide to be tested were mixed, placed onto nylon meshes 

(pore size 250|Lim; Tetko Inc.) and incubated sequentially at 37°C for 30min and at 4°C for 
2h to induce polymerization. A positive (matrigel and VEGF) and a negative (VEGF alone) 
control were also prepared for each CAM. Polymerized meshes were placed onto the third 
outer region of the CAM and incubated for 24h. To visualize vessels, 400|li1 of fluorescein 
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isothiocyanate dextran (lOmg/ml, SIGMA) was injected in the chick blood stream. After 
5-lOmin incubation, the chick was topically fixed with 3.7% formaldehyde for 5min. The 
meshes were then dissected and mounted onto slides. Fluorescence intensity was analyzed 
with a computer-assisted image program (NIH hnage 1.59). 

5 Peptides used on these assays were synthesized by Chiron (Raleigh, NC). Sequence 

corresponded to amino acids: P-TSPl , 430-447; P-METHl, 549-563; P-METH2, 529-548. 

The evaluation of angiogenic or anti-angiogenic responses relies heavily on the 
sensitivity and specificity of the assays used to assess the response. To evaluate the anti- 
angiogenic activity of these fragments in vivo, two popular and well-accepted angiogenesis 

10 assays were used: the corneal pocket and the chorioallantoic membrane. The visibiUty, 

accessibility, and avascularity of the cornea are highly advantageous and facilitate the 
visualization of the neovascular response and the topical application of the test substances. 
A known amount of angiogenesis factor(s) is implanted, as a pellet, in a pocket made in the 
comea eye. To test an angiogenesis inhibitor, the molecule is implanted with the stimulator 

15 in the same pellet, and the response is compared to the stimulator alone. 

Li these experiments, bFGF was used as the vascularization stimulator. Pellets 
containing the recombinant protein were implanted in mouse corneas and their ability to 
inhibit the bFGF-induced angiogenic response was compared to that of controls. When a 
bFGF pellet containing GST was implanted new capillary vessels grew from the cornea 

20 limbus, across the cornea and into the pellet within 5 days, hi contrast, addition of GST- 

METHl or GST-METH2 to the bFGF pellets completely abolished blood vessel growth. 
Table 4 contains a summary of the results obtained from 4 1 assays performed, hitact TSPl 
purified from platelets and GST-TSPl were used as positive controls. All assays were 
performed at identical concentrations, suggesting that METHl and METH2 have similar 

25 potency to that of TSPl in the inhibition of angiogenesis. In addition, when half of the 

standard concentration was used, a weak, however noticeable response was seen, indicating 
a dose-dependent effect. 
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Table 4. 

Activity of METHl and METH2 recombinant proteins in the 
corneal pocket assay 


brCjrr rellets 


V a.SCU-iollZ'CU. UL/lllCcta/ xxJlcLL \^\JLi.L\^<x^ 


Vehicle 


5/5 


TSPl 


0/5 


GST 


11/11 


GST-TSPl-TI 


1/4 


GST-METHl-TSP 


0/8 


GST-METH2-TSP 


0/8 



In the CAM assay, the angiogenic response is analyzed by measuring the number of 
vessels that grow within a matrix polymer containing the angiogenic growth factor. To 
determine whether recombinant METH 1 and METH2 proteins inhibited neovascularization 
in the CAM assay induced by VEGF, a matrigel polymer containing VEGF and the 

1 5 recombinant protein were implanted in the CAM. Quantitative analysis of the experiments, 

which included three different polymers per treatment are shown in Figure 6A. Matrigels 
polymers containing VEGF plus of GST-METH 1 or GST-METH2 caused greater than 
80% inhibition in blood vessel growth. A similar potency was found using the GST 
recombinant protein derived from the type I repeats of TSPl. Furthermore, the anti- 

20 angiogenic effect of the TSP domains in METHl and METH2 was dose-dependent with a 

complete inhibition of blood vessel growth when 15 //g/ml of protein was used (Figure 6C 
and D). GST alone, at identical concentrations, had no significant effect on VEGF- 
stimulated angiogenesis. CAM assays performed with the unprocessed form of METH- 1 
and METH2 provided similar results to the processed forms. It was unclear whether 

25 processing is not required for function or if the CAM tissue lead to processing of our 

proteins. Thus, the intact protein was incubated with CAM tissue for 24h and was evaluated 
the protein on Western blots. The results demonstrate that the CAM tissue was able to 
generate a 68kD METHl processed protein. 

Synthetic peptides from the second or the third type I repeats of human TSPl can 

30 mimic the anti-angiogenic effects of the intact TSPl (Tolsma, S.S., et al, J. Cell Biol 

722:497-5 1 1 (1993)), In fact, a 19-residue polypeptide was shown to be sufficient to block 
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in vivo neovascularization in the rat cornea and to inhibit the bFGF-induced migration of 
cultured endothelial cells (Vogel, T., etal, /. Cell, Biochem, 55:74-84 (1993); Tolsma, S.S., 
et al, /. Cell Biol 722:497-5 1 1 (1993)). To test whether the same was true for the METHl 
and METH2 TSP domains, peptides derived from the same region were synthesized and 
5 their anti-angiogenic activity was evaluated in the CAM assay. The results are shown in 

Figure 6B. Peptides derived from both the TSP domain of METHl and METH2 blocked 
VEGF-induced angiogenesis similarly to that of TSPl . In contrast, scramble peptides had 
no significant effects. . 



Example 5: Proliferation Assays 



10 Human dermal endothelial cells (HDEC) were isolated and grown on Vitrogen™ 

coated petri-dishes in EBM (Clonetics, San Diego, CA) supplemented with 15% fetal calf 
serum, 25|ag/ml cAMP, and Ijag/ml of hydrocortisone-21 -acetate and were used from 
passages 3 to 6. Cells were made quiescent by incubation of confluent monolayers with 
phenol red-free EBM containing 0.2% BSA for 48h. Human dermal fibroblasts were 

1 5 isolated from neonatal foreskin and by enzymatic dissociation. Both fibroblasts and smooth 

muscle cells were maintained in DMEM supplemented with 10% fetal calf serum. Human 
mammary epithehal cells (HMEC) were purchased from Clonetics and maintained in the 
recommended media (mammary epithelial growth media, MEGM). 

Quiescent human dermal endothelial cells, between passage 3 and 6, were plated on 

20 Vitrogen™ coated 24-well plates in EBM supplemented with 0.2% BSA, 0.1% fetal calf 

serum and 1 ng/ml of bFGF in the presence or absence of the recombinant protein and 
incubated at 5% CO2 at 37 °C for 48h. For vascular smooth muscle (VSM) and fibroblast 
proliferation assays, cells were incubated under the same conditions but using DMEM 
instead of EBM. Human mammary epithelial cells were incubated on their growth media. 

25 A pulse of [^H]-Thymidine (l|iCi/|Lil) was added during the last 4h prior harvesting. Cells 

were washed and fixed in 10% TCA. Incorporation of [^H]-thymidine was determined by 
scintillation counting, as previously described (h-uela-Arispe, M.L. & Sage, E.H,, /. Cell 
Biochem, 52:414 (1993)). 

Statistical analyses were done using In-Stat software (Graph Pad Software) for 

30 Macintosh. Assuming normal distributions, data were analyzed by one-way ANOVA, 
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followed by either T-test Dunnett test for comparisons between groups, or student-Newman- 
Kleus test for multiple comparisons between groups. 

To gain insight into the mechanism by which METHl and METH2 inhibit 
neovascularization, the direct effect of the purified recombinant fusion proteins on 
endothelial cell proliferation was tested. Serum-starved endothelial cells were plated into 
growth medium containing bFGF and FCS. Recombinant proteins (3iLig/ml) were added at 
the same time of plating. 40% (GST-METHl), 45% (6H-GST) or 36% (GST-METH2) 
inhibition was observed, in contrast to a non-significant effect when GST alone was added. 
The recombinant protein from the type I repeats of TSPl had similar inhibitory effects. 
(Figure 7 A) . Furthermore, suppression of proliferation mediated by METH 1 or METH2 was 
dose-dependent, as shown in Figure 7E. The inhibition was observed as early as one day 
after treatment and the inhibitory effect was not toxic and reversible since the removal of the 
recombinant protein and subsequent addition of growth factor alone led to the resumption 
of endothelial cell proliferation. 

The cell specificity of the anti-prohferative effects for METHl and METH2 on the 
endothelium was evaluated by additional proliferation assays on a variety of non-endothelial 
cells. No significant inhibition of proHferation was seen on fibroblasts or smooth muscle 
cell cultures. In contrast, a non significant, but reproducible stimulation of proliferation for 
these two cell types could be observed. This result rules out the presence of any potential 
nonspecific inhibitor of cell growth in the recombinant protein preparations. On mammary 
epithelial cell, however, METHl and METH2 inhibited cell proliferation to the same degree 
as endothelial cells. Interestingly, TSPl also suppresses mammary epithelial cell 
proliferation both in vitro and in a transgenic model. 

The possibility that METHl and METH2 might act as disintegrins is consistent with 
their anti-angiogenic properties. Clearly blockade of avp3 and pi integrins with antibodies 
has been shown to inhibit neovascularization both during development and in tumors 
(Brooks, P.Cera/., Cd/ 85:683-693 (1996); Brooks, P.C., era/., Ce/Z 92:391-400 (1998); 
Senger, D.R., et al, Proc, Natl Acad, Sci, USA 94:13612-13617 (1997)). hitegrins are 
essential for the mediation of both proliferative and migratory signals (Schwartz, M.A. & 
Ingber, D.E. , Mol Biol Cell 5:389-393 (1994)), therefore interference with those signals can 
be highly deleterious to the angiogenic process. The angiogenic functional assays were 
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performed with recombinant protein containing only the type I repeats in METHl and 
METH2. 

The mechanism of action of METHl and METH2 with regards to their angio- 
inhibitory activity is not known. To date we have evidence that these proteins are secreted 
and bind to endothehal cells. Further investigations are guided towards the identification 
of receptors and signal transduction mechanisms. A likely hypothesis resulting from the 
lessons learned from TSPl is that both METHl and METH2 bind to CD36. Recently, this 
scavenger receptor has been implicated in the mediation of signals by which TSP- 1 exert its 
anti-angiogenic effects (Dawson, D.W., et al. J, Cell Biol 138:101-111 (1997)). Both the 
CSVTCG (SEQ ID NO:83) (Asch, A.S., et al, Nature 262: 1436-1439 (1993); Catimel, B., 
et aU Biochem, 7, 254:231-236 (1992)) and the GCQXR (SEQ ID NO:84) sequences have 
been proposed as primary binding motifs to CD36 (Dawson, D.W., et al, / Cell Biol 
1 38:101-111 (1997)). METHl and METH2 have almost entire conservation in both these 
regions. A complementary and also likely occurrence is binding of METHl and METH2 
to bPGF. Binding to heparin and bFGF has been proposed as part of the anti-angiogenic 
activity of TSPl (Guo, N., et al, J, Peptide Res, 49 (1997)). This property appears to be 
mediated through the WSXWS (SEQ ID NO:82) motif, also conserved in METHl and 
METH2. Future efforts will focus on the signals implicated in the anti-angiogenic properties 
mediated by these novel proteins and on their potential as proteases of the extracellular 
milieu. 

Example 6: Isolation of the METHl or METHl cDNA Clone From the Deposited Sample 

Two approaches can be used to isolate METHl or METH2 from the deposited 
sample. First, the deposited clone is transformed into a suitable host (such as XL-1 Blue 
(Stratagene)) using techniques known to those of skill in the art, such as those provided by 
the vector supplier or in related publications or patents. The transformants are plated on 
1 .5% agar plates (containing the appropriate selection agent, e.g., ampicillin) to a density of 
about 150 transformants (colonies) per plate. A single colony is then used to generate DNA 
using nucleic acid isolation techniques well known to those skilled in the art. (e.g., 
Sambrook et al , Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), Cold Spring 
Harbor Laboratory Press.) 
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Alternatively, two primers of 17-20 nucleotides derived from both ends of the SEQ 
ID NO:l or SEQ ID NO:3 (i.e., within the region of SEQ ID NO:l or SEQ ID NO:3 
bounded by the 5' NT and the 3' NT of the clone) are synthesized and used to amplify the 
METHl or METH2 cDNA using the deposited cDNA plasmids as templates. The 
5 polymerase chain reaction is carried out under routine conditions, for instance, in 25 jil of 

reaction mixture with 0.5 jiig of the above cDNA template. A convenient reaction mixture 
is 1.5-5 mM MgCls, 0.01% (w/v) gelatin, 20 each of dATP, dCTP, dGTP, dTTP, 25 
pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PGR 
(denaturation at for 1 min; annealing at 55 ""C for 1 min; elongation at 72 ""C for 1 min) 

10 are performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product 

is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
weight is excised and purified. The PGR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 

1 5 portions of the METHl or METH2 gene which may not be present in the deposited clones. 

These methods include but are not limited to, filter probing, clone enrichment using specific 
probes, and protocols similar or identical to 5' and 3' "RAGE" protocols which are well 
known in the art. For instance, a method similar to 5' RAGE is available for generating the 
missing 5' end of a desired full-length transcript. (Fromont-Racine et al , Nucleic Acids Res. 

20 2if7;:1683-1684(1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population of 
RNA presumably containing full-length gene RNA transcripts, A primer set containing a 
primer specific to the ligated RNA oligonucleotide and a primer specific to a known 
sequence of the METHl or METH2 gene of interest is used to PGR amplify the 5' portion 

25 of the METH 1 or METH2 full-length gene. This amplified product may then be sequenced 

and used to generate the full length gene. 

This above method starts with total RNA isolated from the desired source, although 
poly-A+ RNA can be used. The RNA preparation can then be treated with phosphatase if 
necessary to eliminate 5' phosphate groups on degraded or damaged RNA which may 

30 interfere with the later RNA ligase step. The phosphatase should then be inactivated and the 

RNA treated with tobacco acid pyrophosphatase in order to remove the cap structure present 
at the 5' ends of messenger RNAs. This reaction leaves a 5* phosphate group at the 5' end 
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of the cap cleaved RNA which can then be ligated to an RN A oligonucleotide using T4 
RNA ligase. 

This modified RNA preparation is used as a template for first strand cDN A synthesis 
using a gene specific oligonucleotide. The first strand synthesis reaction is used as a 
5 template for PGR amplification of the desired 5' end using a primer specific to the ligated 

RNA oligonucleotide and a primer specific to the known sequence of the gene of interest. 
The resultant product is then sequenced and analyzed to confirm that the 5' end sequence 
belongs to the METHl or METH2 gene. 



Example 7: Bacterial Expression of METHl or METH2 



10 A METHl or METH2 polynucleotide encoding a METHl or METH2 polypeptide 

invention is amplified using PGR oligonucleotide primers corresponding to the 5' and 3' ends 
of the DNA sequence, as outlined in Example 5, to synthesize insertion fragments. The 
primers used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product into the 

1 5 expression vector. For example, BamHI and Xbal correspond to the restriction enzyme sites 

on the bacterial expression vector pQE-9. (Qiagen, Inc., Ghatsworth, GA). This plasmid 
vector encodes antibiotic resistance (Amp'), a bacterial origin of replication (ori), an IPTG- 
regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 6-histidine tag (6- 
His), and restriction enzyme cloning sites. The pQE-9 vector is digested with BamHI and 

20 Xbal and the amplified fragment is ligated into the pQE-9 vector maintaining the reading 

frame initiated at the bacterial RBS. The ligation mixture is then used to transform the E. 
coli strain M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, 
which expresses the lad repressor and also confers kanamycin resistance (Kan'). 
Transformants are identified by their ability to grow on LB plates and ampicillin/kanamycin 

25 resistant colonies are selected. Plasmid DNA is isolated and confirmed by restriction 

analysis. 

Glones containing the desired constructs are grown overnight (O/N) in liquid culture 
in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The O/N 
culture is used to inoculate a large culture at a ratio of 1 : 100 to 1 :250. The cells are grown 
30 to an optical density 600 (O.D.^^) of between 0.4 and 0,6. IPTG (Isopropyl-B-D-thiogalacto 
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pyranoside) is then added to a final concentration of 1 mM. IPTG induces by inactivating 
the lad repressor, clearing the P/O leading to increased gene expression. 
Cells are grown for an extra 3 to 4 hours. Cells are then harvested by centrifugation (20 
mins at 6000Xg). The cell pellet is solubilized in the chaotropic agent 6 Molar Guanidine 
5 HCl by stirring for 3-4 hours at 4°C. The cell debris is removed by centrifugation, and the 

supernatant containing the polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni- 
NTA") affinity resin column (available from QIAGEN, Inc., supra). Proteins with a 6 x His 
tag bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure (for details see: The QIAexpressionist (1995) QIAGEN, Inc., supra). Briefly, the 
10 supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, the column is first 

washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 volumes of 6 
M guanidine-HCl pH 6, and finally the polypeptide is eluted with 6 M guanidine-HCl, pH 
5. 

The purified METHl or METH2 protein is then renatured by dialyzing it against 

15 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL 

Alternatively, the METHl or METH2 protein can be successfully refolded while 
inmiobilized on the Ni-NTA column. The recommended conditions are as follows: renature 
using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 
7.4, containing protease inhibitors. The renaturation should be performed over a period of 

20 1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 mM 

immidazole. Inomidazole is removed by a final dialyzing step against PBS or 50 mM 
sodium acetate pH 6 buffer plus 200 mM NaCL The purified METHl or METH2 protein 
is stored at 4°C or frozen at -80 °C. 

In addition to the above expression vector, the present invention further includes an 

25 expression vector comprising phage operator and promoter elements operatively linked to 

a METHl or METH2 polynucleotide, called pHE4a. (ATCC Accession Number 209645, 
deposited February 25, 1998.) This vector contains: 1) aneomycinphosphotransferasegene 
as a selection marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 
4) two lac operator sequences, 5) a Shine-Delgarno sequence, and 6) the lactose operon 

30 repressor gene (laclq). The origin of replication (oriC) is derived from pUC19 (LTI, 

Gaithersburg, MD). The promoter sequence and operator sequences are made synthetically. 
DNA can be inserted into the pHEa by restricting the vector with Ndel and Xbal, 
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BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating the larger 
fragment (the stuff er fragnaent should be about 310 base pairs) . The DNA insert is generated 
according to the PGR protocol described in Example 5 , using PGR primers having restriction 
sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or Asp718 (3' primer). The PGR insert 
5 is gel purified and restricted with compatible enzymes. The insert and vector are ligated 

according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to express 
protein in a bacterial system. 

Example 8: Purification ofMETHl or METH2 Polypeptide from an Inclusion Body 

10 The following alternative method can be used to purify METHl or METH2 

polypeptide expressed in E coli when it is present in the form of inclusion bodies. Unless 
otherwise specified, all of the following steps are conducted at 4-10 °G. 

Upon completion of the production phase of the E, coli fermentation, the cell culture 
is cooled to 4-10 °G and the cells harvested by continuous centrifugation at 15,000 rpm 

15 (Heraeus Sepatech). On the basis of the expected yield of protein per unit weight of cell 

paste and the amount of purified protein required, an appropriate amount of cell paste, by 
weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. 
The cells are dispersed to a homogeneous suspension using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 

20 (Microfuidics, Gorp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then 

mixed with NaGl solution to a final concentration of 0.5 M NaGl, followed by centrifugation 
at 7000 xg for 1 5 min. The resultant pellet is washed again using 0.5M NaGl, 100 mM Tris, 
50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 

25 hydrochloride (GuHGl) for 2-4 hoiirs. After 7000 xg centrifugation for 15 min., the pellet 

is discarded and the polypeptide containing supernatant is incubated at 4''G overnight to 
allow further GuHGl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, the 
GuHGl solubilized protein is refolded by quickly mixing the GuHGl extract with 20 volumes 

30 of buffer containing 50 mM sodium, pH 4.5, 150 mM NaGl, 2 mM EDTA by vigorous 
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stirring. The refolded diluted protein solution is kept at 4''C without mixing for 12 hours 
prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 urn membrane filter with appropriate surface area (e.g., 

5 Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 

is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The 
column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 
1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance 
at 280 nm of the effluent is continuously monitored. Fractions are collected and further 

1 0 analyzed by SDS-P AGE. 

Fractions containing the METHl or METH2 polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously prepared set 
of tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 
(Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated with 

15 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 

6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column volume linear 
gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM 
sodium acetate, pH 6.5. Fractions are collected under constant A280 monitoring of the 
effluent. Fractions containing the polypeptide (determined, for instance, by 16% SDS- 

20 PAGE) are then pooled. 

The resultant METHl or METH2 polypeptide should exhibit greater than 95% purity 
after the above refolding and purification steps. No major contaminant bands should be 
observed from Coomassie blue stained 16% SDS-P AGE gel when 5 ug of purified protein 
is loaded. The purified METHl or METH2 protein can also be tested for endotoxin/LPS 

25 contamination, and typically the LPS content is less than 0. 1 ng/ml according to LAL assays . 



Example 9: Cloning and Expression of METHl or METHl in a Baculovirus Expression 
System 

In this example, the plasmid shuttle vector pA2 is used to insert METHl or METH2 
polynucleotide into a baculovirus to express METHl or METH2. This expression vector 
30 contains the strong polyhedrin promoter of the Auto grapha calif arnica nuclear polyhedrosis 

virus (AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 
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Asp718, The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmdd contains the beta- 
galactosidase gene from E, coli under control of a weak Drosophila promoter in the same 
orientation, followed by the polyadenylation signal of the polyhedrin gene. The inserted 

5 genes are flanked on both sides by viral sequences for cell-mediated homologous 

recombination with wild- type viral DNA to generate a viable virus that expresses the cloned 
METHl orMETH2 polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such as 
pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as long 

1 0 as the construct provides appropriately located signals for transcription, translation, secretion 

and the like, including a signal peptide and an in-frame AUG as required. Such vectors are 
described, for instance, in Luckow et al. Virology 770:31-39 (1989). 

Specifically, the METHl or METH2 cDNA sequence contained in the deposited 
clone, including the AUG initiation codon and any naturally associated leader sequence, is 

1 5 amplified using the PGR protocol described in Example 5. If the naturally occurring signal 

sequence is used to produce the secreted protein, the pA2 vector does not need a second 
signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a baculovirus 
leader sequence, using the standard methods described in Summers et al., "A Manual of 
Methods for Baculovirus Vectors and Lisect Cell Culture Procedures," Texas Agricultural 

20 Experimental Station Bulletin No, 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La JoUa, Ca.). The fragment then is digested with 
appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and optionally, 

25 can be dephosphorylated using calf intestinal phosphatase, using routine procedures known 

in the art. The DNA is then isolated from a 1% agarose gel using a commercially available 
kit ("Geneclean" BIO 101 Inc., La Jolla, Ca,). 

The fragment and the dephosphorylated plasmid are ligated together with T4 DNA 
ligase. E, coli HB 101 or other suitable E, coli hosts such as XL- 1 Blue (Stratagene Cloning 

30 Systems, La Jolla, C A) cells are transformed with the ligation mixture and spread on culture 

plates. Bacteria containing the plasmid are identified by digesting DNA from individual 
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colonies and analyzing the digestion product by gel electrophoresis. The sequence of the 
cloned fragment is confirmed by DNA sequencing. 

Five ug of a plasmid containing the polynucleotide is co-transfected with 1.0 ug of 
a commercially available linearized baculovirus DNA ("BaculoGold^ baculovirus DNA", 

5 Pharmingen, San Diego, CA), using the lipofection method described by Feigner et al, 

Proc, Natl Acad. ScL USA 8^:7413-7417 (1987). One ug of BaculoGold^ virus DNA and 
5 ug of the plasmid are mixed in a sterile well of a microtiter plate containing 50 ul of 
serum-free Grace's medium (Life Technologies Inc., Gaithersburg, MD). Afterwards, 10 ul 
Lipofectin plus 90 ul Grace's medium are added, mixed and incubated for 15 minutes at 

10 room temperature. Then the transfection mixture is added drop-wise to Sf9 insect cells 

(ATCC CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's medium 
without serum. The plate is then incubated for 5 hours at 27 ''C. The transfection solution 
is then removed from the plate and 1 ml of Grace's insect medium supplemented with 10% 
fetal calf serum is added. Cultivation is then continued at 27 ""C for four days. 

15 After four days the supernatant is collected and a plaque assay is performed, as 

described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of gal- 
expressing clones, which produce blue-stained plaques. (A detailed description of a "plaque 
assay" of this type can also be found in the user's guide for insect cell culture and 

20 baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) After 

appropriate incubation, blue stained plaques are picked with the tip of a micropipettor (e.g., 
Eppendorf). The agar containing the recombinant viruses is then resuspended in a 
microcentrifuge tube containing 200 ul of Grace's medium and the suspension containing 
the recombinant baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. Four days 

25 later the supernatants of these culture dishes are harvested and then they are stored at 4°C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's medium 
supplemented with 10% heat-inactivated FBS. The cells are infected with the recombinant 
baculovirus containing the polynucleotide at a multiplicity of infection ("MOI") of about 2. 
If radiolabeled proteins are desired, 6 hours later the medium is removed and is replaced 

30 with SF900 E medium minus methionine and cysteine (available from Life Technologies 

Inc., Rockville, MD). After 42 hours, 5 uCi of ^^S -methionine and 5 uCi ^^S-cysteine 
(available from Amersham) are added. The cells are further incubated for 16 hours and then 



-sn- 



are harvested by centrifugation. The proteins in the supernatant as well as the intracellular 
proteins are analyzed by SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced METHl or 
5 METH2 protein. 

Example 10: Expression of METHl or METHl in Mammalian Cells 

METHl or METH2 polypeptide can be expressed in a mammalian cell. A typical 
manomalian expression vector contains a promoter element, which mediates the initiation 
of transcription of mRNA, a protein coding sequence, and signals required for the 

10 termination of transcription and polyadenylation of the transcript. Additional elements 

include enhancers, Kozak sequences and intervening sequences flanked by donor and 
acceptor sites for RNA splicing. Highly efficient transcription is achieved with the early and 
late promoters from SV40, the long terminal repeats (LTRs) from Retroviruses, e.g., RSV, 
HTLVI, HIVI and the early promoter of the cytomegalovirus (CMV), However, cellular 

15 elements can also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, for 
example, vectors such as pS VL and pMSG (Pharmacia, Uppsala, Sweden), pRS Vcat (ATCC 
37152), pSV2DHFR (ATCC 37146), pBC12MI (ATCC 67109), pCMVSport 2.0, and 
pCMVSport 3.0, Mammalian host cells that could be used include, human Hela, 293, H9 

20 and Jurkat cells, mouse Nffl3T3 and C127 cells, Cos 1, Cos 7 and CVl, quail QCl-3 cells, 

mouse L cells and Chinese hamster ovary (CHO) cells. 

Alternatively, METHl or METH2 polypeptide can be expressed in stable cell lines 
containing the METHl or METH2 polynucleotide integrated into a chromosome. The co- 
transfection with a selectable marker such as DHFR, gpt, neomycin, hygromycin allows the 

25 identification and isolation of the transfected cells. 

The transfected METHl or METH2 gene can also be amplified to express large 
amounts of the encoded protein. The DHFR (dihydrofolate reductase) marker is useful in 
developing cell lines that carry several hundred or even several thousand copies of the gene 
of interest. (See, e.g., Alt, F. W., etal, J. Biol Chem, 253:1357-1370 (1978); Hamlin, J. 
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L. and Ma, C, Biochem, etBiophys. Acta 1097: 107-143 (1990); Page, M. J. and Sydenham, 
M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is the enzyme 
glutamine synthase (GS) (Murphy etal , Biochem J, 227:277-279 (1991); Bebbington etal., 
Bio/Technology iO: 169-175 (1992). Using these markers, the mammaUan cells are grown 
5 in selective medium and the cells with the highest resistance are selected. These cell lines 

contain the amplified gene(s) integrated into a chromosome. Chinese hamster ovary (CHO) 
and NSO cells are often used for the production of proteins. 

Derivatives of the plasmid pSV2-DHFR (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 

10 No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (CuUen et al. 

Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the CMV- 
enhancer (Boshart et al. Cell 47:521-530 (1985).) Multiple cloning sites, e.g., with the 
restriction enzyme cleavage sites BamHI, Xbal and Asp7 1 8, facilitate the cloning of METH 1 
or METH2. The vectors also contain the 3' intron, the polyadenylation and termination 

15 signal of the rat preproinsulin gene, and the mouse DHFR gene under control of the SV40 

early promoter. 

If a naturally occurring signal sequence is used to produce a secreted protein, the 
vector does not need a second signal peptide. Alternatively, if a naturally occurring signal 
sequence is not used, the vector can be modified to include a heterologous signal sequence 

20 in an effort to secrete the protein from the cell. (See, e.g., WO 96/34891 .) 

The amplified fragment is then digested with the appropriate restriction enzyme and 
purified on a 1 % agarose gel using acommercially available kit ("Geneclean," BIO 101 Inc., 
La JoUa, Ca.). The isolated fragment and the dephosphorylated vector are then ligated with 
T4 DNA ligase. E. coli HBlOl or XL-1 Blue cells are then transformed and bacteria are 

25 identified that contain the fragment inserted into plasmid pC6 or pC4 using, for instance, 

restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. 
Five |LLg of the expression plasmid pC6 or pC4 is cotransfected with 0.5 ^ig of the plasmid 
pS Vneo using lipofectin (Feigner et al , supra) . The plasmid pS V2-neo contains a dominant 

30 selectable marker, the neo gene from Tn5 encoding an enzyme that confers resistance to a 

group of antibiotics including G418. The cells are seeded in alpha minus MEM 
supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and seeded in 
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hybridoma cloning plates (Greiner, Germany) in alpha minus MEM supplemented with 10, 
25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. After about 10-14 days single clones 
are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different 
concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). Clones 
growing at the highest concentrations of methotrexate are then transferred to new 6-well 
plates containing even higher concentrations of methotrexate (1 uM, 2 uM, 5 uM, 10 mM, 
20 mM). The same procedure is repeated until clones are obtained which grow at a 
concentration of 100 - 200 uM. Expression of METHl or METH2 is analyzed, for instance, 
by SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 11: Construction ofN-Terminal and/or C-Terminal Deletion Mutants 

The following general approach may be used to clone a N-terminal or C-terminal 
deletion METHl or METH2 deletion mutant. Generally, two oligonucleotide primers of 
about 15-25 nucleotides are derived from the desired 5' and 3' positions of a polynucleotide 
of SEQ ID NO:l or SEQ ID NO:3, The 5' and 3' positions of the primers are determined 
based on the desired METHl or METH2 polynucleotide fragment. An initiation and stop 
codon are added to the 5' and 3' primers respectively, if necessary, to express the METHl 
or METH2 polypeptide fragment encoded by the polynucleotide fragment. Preferred 
METHl or METH2 polynucleotide fragments are those encoding the N-terminal and C- 
terminal deletion mutants disclosed above in the "Polynucleotide and Polypeptide 
Fragments" section of the Specification. 

Additional nucleotides containing restriction sites to facilitate cloning of the METH 1 
or METH2 polynucleotide fragment in a desired vector may also be added to the 5' and 3' 
primer sequences. The METHl or METH2 polynucleotide fragment is amplified from 
genomic DNA or from the deposited cDNA clone using the appropriate PGR 
oligonucleotide primers and conditions discussed herein or known in the art. The METHl 
or METH2 polypeptide fragments encoded by the METHl or METH2 polynucleotide 
fragments of the present invention may be expressed and purified in the same general 
manner as the full length polypeptides, although routine modifications may be necessary due 
to the differences in chemical and physical properties between a particular fragment and full 
length polypeptide. 
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As a means of exemplifying but not limiting the present invention, the 
polynucleotide encoding the METHl polypeptide fragment R-235 to L-934 or the METH2 
polypeptide fragment R-214 to Q-836 is amplified and cloned as follows: A 5' primer is 
generated comprising a restriction enzyme site followed by an initiation codon in frame with 

5 the polynucleotide sequence encoding the N-terminal portion of the polypeptide fragment 

beginning with R-235 or R-214, respectively. A complementary 3' primer is generated 
comprising a restriction enzyme site followed by a stop codon in frame with the 
polynucleotide sequence encoding C-terminal portion of the METH 1 or METH2 polypeptide 
fragment ending with L-934 or Q-836, respectively. 

10 The amplified polynucleotide fragment and the expression vector are digested with 

restriction enzymes which recognize the sites in the primers. The digested polynucleotides 
are then ligated together. The METHl or METH2 polynucleotide fragment is inserted into 
the restricted expression vector, preferably in a manner which places the METH 1 or METH2 
polypeptide fragment coding region downstream from the promoter. The ligation mixture 

15 is transformed into competent E. coli cells using standard procedures and as described in the 

Examples herein. Plasmid DNA is isolated from resistant colonies and the identity of the 
cloned DNA confirmed by restriction analysis, PGR and DNA sequencing. 



Example 12: Protein Fusions of METHl or METHl 

METHl or METH2 polypeptides are preferably fused to other proteins. These fusion 
20 proteins can be used for a variety of applications. For example, fusion of METHl or 

METH2 polypeptides to His-tag, HA-tag, protein A, IgG domains, and maltose binding 
protein facilitates purification. (See Example 7; see also EP A 394,827; Traunecker, etaL, 
Nature 537:84-86 (1988).) Similarly, fusion to IgG-1, IgG-3, and albumin increases the 
halflife time in vivo. Nuclear localization signals fused to METH 1 or METH2 polypeptides 
25 can target the protein to a specific subcellular localization, while covalent heterodimer or 

homodimers can increase or decrease the activity of a fusion protein. Fusion proteins can 
also create chimeric molecules having more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein compared to the non-fused protein. 
All of the types of fusion proteins described above can be made by modifying the following 
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protocoU which outlines the fusion of a polypeptide to an IgG molecule, or the protocol 

described in Example 7. 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 

primers that span the 5' and 3' ends of the sequence described below. These primers also 
5 should have convenient restriction enzyme sites that will facilitate cloning into an expression 

vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion can be 

ligated into the BamHI cloning site. Note that the 3' BamHI site should be destroyed. Next, 

the vector containing the human Fc portion is re-restricted with BamHI, linearizing the 
10 vector, and METHl or METH2 polynucleotide, isolated by the PGR protocol described in 

Example 5, is ligated into this BamHI site. Note that the polynucleotide is cloned without 

a stop codon, otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted protein, 

pG4 does not need a second signal peptide. Alternatively, if the naturally occurring signal 
1 5 sequence is not used, the vector can be modified to include a heterologous signal sequence. 

(See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGGGGAAATGTTCTGACAAAACTGAGAGATGGCGAGCGTGGGG 
AGCACCTGAATTGGAGGGTGCACCGTGAGTCTTCGTCTTGGGGCGAAAAGGG 

20 AAGGACACGGTGATGATCTCGGGGAGTCGTGAGGTGAGATGGGTGGTGGTGG 
ACGTAAGCCAGGAAGAGGGTGAGGTCAAGTTGAAGTGGTAGGTGGAGGGGG 
TGGAGGTGCATAATGCCAAGAGAAAGCCGGGGGAGGAGGAGTAGAAGAGCA 
CGTAGGGTGTGGTCAGCGTGGTCAGGGTGCTGGACGAGGAGTGGGTGAATGG 
GAAGGAGTAGAAGTGCAAGGTGTGCAACAAAGGGGTGCCAAGCCGCATGGA 

25 GAAAAGCATGTCGAAAGGCAAAGGGGAGCCCGGAGAAGCACAGGTGTAGAG 
CCTGGGCCGATGCCGGGATGAGGTGACGAAGAAGGAGGTCAGGGTGAGGTGG 
CTGGTGAAAGGCTTCTATGCAAGGGAGATGGGGGTGGAGTGGGAGAGCAATG 
GGGAGGGGGAGAAGAAGTACAAGACCAGGGCTGCGGTGGTGGAGTGGGAGG 
GCTCCTTGTTCCTGTAGAGCAAGCTGAGCGTGGACAAGAGCAGGTGGCAGCA 

30 GGGGAAGGTGTTCTCATGGTGGGTGATGCATGAGGGTGTGGAGAAGGAGTAC 
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ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGCGACGGCCGCGAC 
TCTAGAGGAT (SEQIDNO:85) 

Example 13: Production of an Antibody 

a) The antibodies of the present invention can be prepared by a variety of 
methods, (See, Current Protocols, Chapter 2,) For example, cells expressing METHl or 
METH2 is administered to an animal to induce the production of sera containing polyclonal 
antibodies. In a preferred method, a preparation of METHl or METH2 protein is prepared 
and purified to render it substantially free of natural contaminants. Such a preparation is 
then introduced into an animal in order to produce polyclonal antisera of greater specific 
activity. 

In the most preferred method, the antibodies of the present invention are monoclonal 
antibodies (or protein binding fragments thereof). Such monoclonal antibodies can be 
prepared using hybridoma technology. (Kohler et al , Nature 256:495 (1975); Kohler et al , 
Eur 7, Immunol 6:511 (1976); Kohler et al, Eur. J, Immunol 6:292 (1976); Hammerling 
et al, in: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y., pp. 563-681 
(1981).) In general, such procedures involve immunizing an animal (preferably a mouse) 
with METHl or METH2 polypeptide or, more preferably, with a secreted METHl or 
METH2 polypeptide-expressing cell. Such cells may be cultured in any suitable tissue 
culture medium; however, it is preferable to culture cells in Earle's modified Eagle's medium 
supplemented with 10% fetal bovine serum (inactivated at about 56 ""C), and supplemented 
with about 10 g/1 of nonessential amino acids, about 1 ,000 U/ml of penicillin, and about 100 
ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma cell 
line. Any suitable myeloma cell line may be employed in accordance with the present 
invention; however, it is preferable to employ the parent myeloma cell line (SP20), 
available from the ATCC. After fusion, the resulting hybridoma cells are selectively 
maintained in HAT medium, and then cloned by limiting dilution as described by Wands et 
al {Gastroenterology 80:225-232 (1981).) The hybridoma cells obtained through such a 
selection are then assayed to identify clones which secrete antibodies capable of binding the 
METHl or METH2 polypeptide. 
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Alternatively, additional antibodies capable of binding to METHl or METH2 
polypeptide can be produced in a two-step procedure using anti-idiotypic antibodies. Such 
a method makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with this 
method, protein specific antibodies are used to immunize an animal, preferably a mouse. 
The splenocytes of such an animal are then used to produce hybridoma cells, and the 
hybridoma cells are screened to identify clones which produce an antibody whose abihty to 
bind to the METHl or METH2 protein-specific antibody can be blocked by METHl or 
METH2. Such antibodies comprise anti-idiotypic antibodies to the METHl or METH2 
protein-specific antibody and can be used to immunize an animal to induce formation of 
further METHl or METH2 protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies of 
the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, secreted 
METHl or METH2 protein-binding fragments can be produced through the application of 
recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use "humanized" 
chimeric monoclonal antibodies. Such antibodies can be produced using genetic constructs 
derived from hybridoma cells producing the monoclonal antibodies described above. 
Methods for producing chimeric antibodies are known in the art. (See, for review, Morrison, 
Science 229: 1202 (1985); Oi et al, BioTechniques 4:214 (1986); Cabilly etal, U.S. Patent 
No. 4,816,567; Taniguchi et a/., EP 171496; Morrison et a/., EP 173494; Neuberger et al, 
WO 8601533; Robinson et al, WO 8702671; Boulianne et al. Nature 312:643 (1984); 
Neuberger et al. Nature 314:26S (1985).) 

b) Isolation of antibody fragments directed against METHl and/or METH2 
from a library of scFvs. 

Naturally occuring V-genes isolated from human PBLs are constructed into a large 
library of antibody fragments which contain reactivities against METHl and/or METH2 to 
which the donor may or may not have been exposed (see e.g., U.S. Patent 5,885,793 
incorporated herein in its entirety by reference). 
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Rescue of the Library. A library of scFvs is constructed from the RN A of human 
PBLs as described in WO92/01047. To rescue phage displaying antibody fragments, 
approximately 10^ E, coli harbouring the phagemid are used to inoculate 50 ml of 2xTY 
containing 1% glucose and 100 ug/ml of ampicillin (2xTY-AMP-GLU) and grown to an 
5 O.D. of 0.8 with shaking. Five ml of this culture is used to innoculate 50 ml of 2xTY-AMP- 

GLU, 2 X 10^ TU of delta gene 3 helper (M13 delta gene m, see WO92/01047) are added 
and the culture incubated at 37°C for 45 minutes without shaking and then at SV'^C for 45 
minutes with shaking. The culture is centrifuged at 4000 r.p.m. for 10 min. and the pellet 
resuspended in 2 hters of of 2xTY containing 100 ug/ml ampicillin and 50 ug/ml kanamycin 
10 and grown overnight. Phage are prepared as described in WO92/01047. 

Ml 3 delta gene III is prepared as follows: Ml 3 delta gene HI helper phage does not 
encode gene m protein, hence the phage(mid) displaying antibody fragments have a greater 
avidity of binding to antigen, hifectious Ml 3 delta gene m particles are made by growing 
the helper phage in cells harbouring a pUCl9 derivative supplying the wild type gene m 
15 protein during phage morphogenesis. The culture is incubated for 1 hour at 37 °C without 

shaking and then for a further hour at 37 °C with shaking. Cells are spun down (EEC-Centra 
8, 4000 revs/min for 10 min), resuspended in 300 ml 2xTY broth containing 100 ug 
ampicillin/ml and 25 ug kanamycin/ml (2xTY-AMP-KAN) and grown overnight, shaking 
at 37 ''C. Phage particles are purified and concentrated from the culture medium by two 
20 PEG-precipitations (Sambrook et al , 1990), resuspended in 2 ml PBS and passed through 

a 0,45 um filter (Minisart NML; Sartorius) to give a final concentration of approximately 
10^^ transducing units/ml (ampicillin-resistant clones). 

Panning of the Library, himiunotubes (Nunc) are coated overnight in PBS with 
4 ml of either 100 ug/ml or 10 ug/ml of a polypeptide of the present invention. Tubes are 
25 blocked with 2% Marvel-PBS for 2 hours at 37 °C and then washed 3 times in PBS. 

Approximately 10^^ TU of phage is applied to the tube and incubated for 30 minutes at room 
temperature tumbling on an over and under tumtable and then left to stand for another 1.5 
hours. Tubes are washed 10 times with PBS 0. 1 % Tween-20 and 10 times with PBS. Phage 
are eluted by adding 1 ml of 100 mM triethylamine and rotating 15 minutes on an under and 
30 over tumtable after which the solution is immediately neutralized with 0.5 ml of I .OM Tris- 

HCl, pH 7.4. Phage are then used to infect 10 ml of mid-log E. coli TGI by incubating 
eluted phage with bacteria for 30 minutes at 37 °C. The E. coli are then plated on TYE plates 



-325- 



containing 1% glucose and 100 ug/ml ampicillin. The resulting bacterial library is then 
rescued with delta gene 3 helper phage as described above to prepare phage for a subsequent 
round of selection. This process is then repeated for a total of 4 rounds of affinity 
purification with tube-washing increased to 20 times with PBS, 0.1% Tween-20 and 20 
5 times with PBS for rounds 3 and 4. 

Characterization of Binders. Eluted phage from the 3rd and 4th rounds of 
selection are used to infect E. coli HB 2151 and soluble scFv is produced (Marks, et a/., 
1991) from single colonies for assay. ELISAs are performed with microtitre plates coated 
with either 10 pg/ml of the polypeptide of the present invention in 50 mM bicarbonate pH 
10 9.6. Clones positive in ELISA are further characterized by PCR fingerprinting (see e.g., 

% WO92/01047) and then by sequencing. 

UJ Example 14: Production Of METHl or METH2 Protein For High-Throughput 

Screening Assays 

% The following protocol produces a supernatant containing METHl or METH2 

C0l5 polypeptide to be tested. This supernatant can then be used in the Screening Assays 

l y described in Examples 16-23. 

;if First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution ( Img/ml 

in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for a working 
solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) and incubate 
20 at RT for 20 minutes. Be sure to distribute the solution over each well (note: a 12-channel 

pipetter may be used with tips on every other channel). Aspirate off the Poly-D-Lysine 
solution and rinse with 1ml PBS (Phosphate Buffered Saline). The PBS should remain in 
the well until just prior to plating the cells and plates may be poly-lysine coated in advance 
for up to two weeks. 

25 Plate 293T cells (do not carry cells past P+20) at 2 x 10^ cells/well in .5ml 

DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine (12- 
604FBiowhittaker))/10% heat inactivated FBS(14-503FBiowhittaker)/lxPenstrep(17-602E 

Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine (18324- 
30 012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. With a small 
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volume multi-channel pipetter, aliquot approximately 2ug of an expression vector containing 
a polynucleotide insert, produced by the methods described in Examples 10-12, into an 
appropriately labeled 96- well round bottom plate. With a multi-channel pipetter, add 50ul 
of the Lipofectamine/Optimem I mixture to each welL Pipette up and down gently to mix. 
Incubate at RT 15-45 minutes. After about 20 minutes, use a multi-channel pipetter to add 
150ul Optimem I to each well. As a control, one plate of vector DNA lacking an insert 
should be transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following tasks. 
By tag-teaming, hands on time is cut in half, and the cells do not spend too much time on 
PBS. First, person A aspirates off the media from four 24-well plates of cells, and then 
person B rinses each well with .5-lml PBS. Person A then aspirates off PBS rinse, and 
person B, using al2-channel pipetter with tips on every other channel, adds the 200ul of 
DNA/Lipofectamine/Optimem I complex to the odd wells first, then to the even wells, to 
each row on the 24-well plates. Licubate at 37 °C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1 %BS A in DMEM with 
Ix penstrep, or HGS CHO-5 media (1 16.6 mg/L of CaCl2 (anhyd); 0.00130 mg/L CUSO4- 
5H2O; 0.050 mg/L of Fe(N03)3-9H20; 0.417 mg/L of FeS04-7H20; 31 1.80 mg/L of Kcl; 
28.64 mg/L of MgCl^; 48.84 mg/L of MgS04; 6995.50 mg/L of NaCl; 2400.0 mg/L of 
NaHC03; 62.50 mg/L of NaH2PO4-H20; 71.02 mg/L of Na2HP04; .4320 mg/L of ZnS04- 
7H2O; .002mg/Lof Arachidonic Acid; 1.022 mg/L of Cholesterol; .070mg/Lof DL-alpha- 
Tocopherol-Acetate; 0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 0.010 
mg/L of Myristic Acid; 0.010 mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 
mg/L of Palmitic Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L 
of Tween 80; 4551 mg/Lof D-Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L- 
Arginine-HCL; 7.50 mg/ml of L-Asparagine-H20; 6.65 mg/ml of L-Aspartic Acid; 29.56 
mg/ml of L-Cystine-2HCL-H20; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L- 
Glutamic Acid; 365.0 mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L- 
Histidine-HCL-H20; 106.97 mg/ml of L-Isoleucine; 111.45 mg/ml of L-Leucine; 163.75 
mg/ml of L-Lysine HCL; 32,34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 
40.0 mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H20; and 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1.78 mg/L of Choline 
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Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of Niacinamide; 
3 .00 mg/L of Pyridoxal HCL; 0.03 1 mg/L of Pyridoxine HCL; 0.319 mg/L of Riboflavin; 
3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; 0.680 mg/L of Vitamin B^^; 25 mM 
of HEPES Buffer; 2.39 mg/Lof NaHypoxanthine; 0.105 mg/L of Lipoic Acid; 0.081 mg/L 
5 of Sodium Putrescine-2HCL; 55.0 mg/L of Sodium Pyruvate; 0,0067 mg/L of Sodium 

Selenite; 20uM of Ethanolamdne; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B- 
Cyclodextrin complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin 
complexed with Oleic Acid; 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal 
Acetate. Adjust osmolarity to 327 mOsm with 2nam glutamine and Ix penstrep. (BSA (81- 
10 068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock solution). Filter the 

Ifl media and collect 50 ul for endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of the 

i J incubation period. Person A aspirates off the transfection media, while person B adds 1 .5ml 

In appropriate media to each well. Incubate at 37 ""C for 45 or 72 hours depending on the 

};45 media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 

S3 On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1 ml deep well 

plate and the remaining supernatant into a 2ml deep well. The supematants from each well 
"SL! can then be used in the assays described in Examples 16-23. 

^£1 It is specifically understood that when activity is obtained in any of the assays 

20 described below using a supernatant, the activity originates from either the METHl or 

METH2 polypeptide directly (e.g., as a secreted protein) or by METHl or METH2 inducing 
expression of other proteins, which are then secreted into the supernatant. Thus, the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay. 

25 Example 15: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation of 
cells is called the Jaks-ST ATs pathway. Activated proteins in the Jaks-ST ATs pathway bind 
to gamma activation site "GAS" elements or interferon-sensitive responsive element 
("ISRE"), located in the promoter of many genes. The binding of a protein to these elements 
30 alter the expression of the associated gene. 
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GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six members 
of the STATs family. Statl and Stat3 are present in many cell types, as is Stat2 (as response 
to IFN-alpha is widespread). Stat4 is more restricted and is not in many cell types though 
it has been found in T helper class I, cells after treatment with IL-12. Stat5 was originally 
called mammary growth factor, but has been found at higher concentrations in other cells 
including myeloid cells. It can be activated in tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase (" Jaks") family. 
Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, Jakl , Jak2, and 
Jak3. These kinases display significant sequence similarity and are generally catalytically 
inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table below. 
(Adapted from review by Schidler and Darnell, Ann, Rev. Biochem. 54:621-51 (1995).) A 
cytokine receptor family, capable of activating Jaks, is divided into two groups: (a) Class 1 
includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL-12, IL-15, Epo, PRL, GH, 
G-CSF, GM-CSF, LDP, CNTF, and thrombopoietin; and (b) Class 2 includes IFN-a, IFN-g, 
and IL-10. The Class 1 receptors share a conserved cysteine motif (a set of four conserved 
cysteines and one tryptophan) and a WSXWS motif (a membrane proxial region encoding 
Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:82)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn activate 
STATs, which then translocate and bind to GAS elements. This entire process is 
encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of the 
GAS or the ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. For example, growth factors and cytokines are known to activate the 
Jaks-STATs pathway. (See Table below.) Thus, by using GAS elements linked to reporter 
molecules, activators of the Jaks-STATs pathway can be identified. 
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IFN family 




IFN-a/B 


+ 


+ 






1,2,3 


ISRE 


IFN-g 




+ 


+ 




1 


GAS (IRFl>Lys6>IFP) 


11-10 


+ 


7 


7 




1,3 




gpl30 family 


IL-6 (Pleiotrophic) 


+ 


+ 




7 


1,3 


GAS (IRFl>Lys6>IFP) 


Il-ll(Pleiotrophic) 


7 




7 


7 


1,3 




OnM (Pleiotrophic) 


? 


+ 


+ 


7 


1,3 




LIF(Pleiotrophic) 


? 


+ 


+ 


7 


1,3 




CNTF(Pleiotrophic) 


-/+ 


+ 


+ 


7 


1,3 




G-CSF(Pleiotrophic) 


7 




7 


7 


1,3 




IL-12(Pleiotrophic) 


+ 




+ 


+ 


1,3 




g-C family 


IL-2 (lymphocytes) 




+ 




+ 


1,3,5 


GAS 


IL-4 (lymph/myeloid) 




+ 




+ 


6 


GAS (IRFl = IFP »Ly6)(IgH) 


IL-7 (lymphocytes) 




+ 




+ 


5 


GAS 


IL-9 (lymphocytes) 




+ 




+ 


5 


GAS 


IL-13 (lymphocyte) 






7 


7 


6 


GAS 


IL-15 


? 


+ 


7 


+ 


5 


GAS 


gpl40 family 


IL-3 (myeloid) 






+ 




5 


GAS (IRFl>IFP»Ly6) 


IL-5 (myeloid) 






+ 




5 


GAS 


GM-CSF (myeloid) 










5 


GAS 


Growth hormone family 


GH 


7 




+ 




5 




PRL 


7 


+/- 


+ 




1,3,5 


GAS(B-CAS>IRFl-IFP»Ly6) 


EPO 


7 




+ 




5 




Receptor Tyrosine Kinases 


EGF 


7 




+ 




1,3 


GAS (IRFl) 


PDGF 


7 


+ 


+ 




1,3 


GAS (not IRFl) 


CSF-1 


7 


+ 


+ 




1,3 





To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 16-17, a PGR based strategy is employed to 
generate a GAS-S V40 promoter sequence. The 5' primer contains four tandem copies of the 
GAS binding site found in the IRFl promoter and previously demonstrated to bind STATs 
upon induction with a range of cytokines (Rothman et al. Immunity 7:457-468 (1994).), 
although other GAS or ISRE elements can be used instead. The 5' primer also contains 1 8bp 
of sequence complementary to the SV40 early promoter sequence and is flanked with an 
Xhol site. The sequence of the 5' primer is: 
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5':GCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGA 
AATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO:86) 

The downstream primer is complementary to the S V40 promoter and is flanked with 
a Hind m site: 5':GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID NO:87) 
5 PGR amplification is performed using the S V40 promoter template present in the B- 

gal:promoter plasmid obtained from Clontech. The resulting PGR fragment is digested with 
Xhol/Hind m and subcloned into BLSK2-, (Stratagene.) Sequencing with forward and 
reverse primers confirms that the insert contains the following sequence: 
5': CTCGAG ATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAAATG 
JO ATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCC 
D TAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCC 

i TGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGC 
n AAA AAGCTT :3' (SEQIDNO:88) 

15 With this GAS promoter element hnked to the SV40 promoter, a GAS:SEAP2 

reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 

- phosphatase, or "SEAP." Clearly, however, any reporter molecule can used be instead of 

5 SEAP, in this or in any of the other Examples. Well known reporter molecules that can be 

used instead of SEAP include chloramphenicol acetyltransf erase (CAT) , luciferase, alkaline 

20 phosphatase, 6-galactosidase, green fluorescent protein (GFP), or any protein detectable by 

an antibody. 

The above sequence confirmed synthetic GAS-S V40 promoter element is subcloned 
into the pSE AP-Promoter vector obtained from Clontech using Hindlll and Xhol, effectively 
replacing the S V40 promoter with the amplified GAS : S V40 promoter element, to create the 

25 GAS-SEAP vector. However, this vector does not contain a neomycin resistance gene, and 

therefore, is not preferred for mammalian expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS-SEAP 
reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using Sail and 
NotI, and inserted into a backbone vector containing the neomycin resistance gene, such as 

30 pGFP-I (Clontech), using these restriction sites in the multiple cloning site, to create the 

GAS-SEAP/Neo vector. Once this vector is transfected into mammalian cells, this vector 
can then be used as a reporter molecule for GAS binding as described in Examples 16-17. 
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Other constructs can be made using the above description and replacing GAS with 
a different promoter sequence. For example, construction of reporter molecules containing 
NFK-B and EGR promoter sequences are described in Examples 1 8 and 1 9. However, many 
other promoters can be substituted using the protocols described in these Examples. For 
instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be substituted, alone or in 
combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, n-2/NFAT, or NF-KB/GAS). 
Similarly, other cell lines can be used to test reporter construct activity, such as HELA 
(epithelial), HUVEC (endothelial), Reh (B-cell), Saos-2 (osteoblast), HUVAC (aortic), or 
Cardiomyocyte. 

Example 16: High-Throughput Screening Assay for T-cell Activity 

The following protocol is used to assess T-cell activity of METHl or METH2 by 
determining whether METHl or METH2 supernatant proliferates and/or differentiates T- 
cells. T-cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 
1 5 . Thus, factors that increase SEAP activity indicate the abiUty to activate the Jaks-ST ATS 
signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 
Accession No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 
Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. Li order to generate stable 
cell lines, approximately 2 million Jurkat cells are transfected with the GAS-SEAP/neo 
vector using DMRIE-C (Life Technologies) (transfection procedure described below). The 
transfected cells are seeded to a density of approximately 20,000 cells per well and 
transfectants resistant to 1 mg/ml genticin selected. Resistant colonies are expanded and 
then tested for their response to increasing concentrations of interferon gamma. The dose 
response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells containing 
200 ul of cells. Thus, it is either scaled up, or performed in multiple to generate sufficient 
cells for multiple 96 well plates, Jurkat cells are maintained in RPMI + 10% serum with 
l%Pen-Strep. Combine 2.5 mis of OPTTMEM (Life Technologies) with 10 ug of plasmid 
DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul of DMRIE-C and incubate 
at room temperature for 15-45 mins. 
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During the incubation period, count cell concentration, spin down the required 
number of cells (10^ per transfection), and resuspend in OPTI-MEM to a final concentration 
of 10^ cells/ml. Then add 1ml of 1 x 10^ cells in OPTI-MEM to T25 flask and incubate at 
37 for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 
5 The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% serum, 

1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supernatants containing 
METHl or METH2 polypeptides or METHl or METH2 induced polypeptides as produced 
by the protocol described in Example 14. 

On the day of treatment with the supernatant, the cells should be washed and 
10 resuspended in fresh RPMI +10% serum to a density of 500,000 cells per ml. The exact 

^'f number of cells required will depend on the number of supernatants being screened. For one 

UJ 96 well plate, approximately 10 million cells (for 10 plates, 100 million cells) are required. 

iu Transfer the cells to a triangular reservoir boat, in order to dispense the cells into a 

fj^ 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul of cells 

"^15 into each well (therefore adding 100,000 cells per well). 

[2 After all the plates have been seeded, 50 ul of the supernatants are transferred 

1^ directly fi*om the 96 well plate containing the supernatants into each well using a 12 channel 

pipette. In addition, a dose of exogenous interferon gamma (0.1, LO, 10 ng) is added to 
kU wells H9, HIO, and HI 1 to serve as additional positive controls for the assay. 

20 The 96 well dishes containing Jurkat cells treated with supernatants are placed in an 

incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples from each 
well are then transferred to an opaque 96 well plate using a 12 channel pipette. The opaque 
plates should be covered (using sellophene covers) and stored at -20 "^C until SEAP assays 
are performed according to Example 20. The plates containing the remaining treated cells 
25 are placed at 4''C and serve as a source of material for repeating the assay on a specific well 

if desired. 

As a positive control, 100 Unit/ml interferon ganmia can be used which is known to 
activate Jurkat T cells. Over 30 fold induction is typically observed in the positive control 
wells. 
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Example 17: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity of METHl or METH2 by 

determining whether METHl or METH2 proUferates and/or differentiates myeloid cells. 

Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 
5 15. Thus, factors that increase SEAP activity indicate the ability to activate the J aks-STATS 

signal transduction pathway. The myeloid cell used in this assay is U937, a pre-monocyte 

cell line, although TF-1, HL60, or KGl can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced in 

Example 15, a DEAE-Dextran method (Kharbanda et. al, 1994, Cell Growth & 
10 Differentiation 5:259-265) is used. First, harvest 2xl0e' U937 cells and wash with PBS. 

The U937 cells are usually grown in RPMI 1640 medium containing 10% heat-inactivated 

fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 mg/ml 

streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 0.5 
1 5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM KCl, 375 

uM Na2HP04-7H20, 1 mM MgCl2, and 675 uM CaClj. Incubate at 37 °C for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then resuspend 
in 10 ml complete medium and incubate at 37 °C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 ug/ml 
20 G418. The G418-free medium is used for routine growth but every one to two months, the 

cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10^ cells (this is enough for ten 96-well plates 
assay) and wash with PBS. Suspend the cells in 200 ml above described growth medium, 
with a final density of 5x10^ cells/ml. Plate 200 ul cells per well in the 96-well plate (or 
25 1x10^ cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 14. 
Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma can 
be used which is known to activate U937 cells. Over 30 fold induction is typically observed 
in the positive control wells. SEAP assay the supernatant according to the protocol 
30 described in Example 20. 
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Example 18: High-Throughput Screening Assay Identifying Neuronal Activity 

When cells undergo differentiation and proliferation, a group of genes are activated 
through many different signal transduction pathways. One of these genes, EGRl (early 
growth response gene 1), is induced in various tissues and cell types upon activation. The 
5 promoter of EGRl is responsible for such induction. Using the EGRl promoter linked to 

reporter molecules, activation of cells can be assessed by METHl or METH2. 

Particularly, the following protocol is used to assess neuronal activity in PCI 2 cell 
lines, PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
Q differentiate by activation with a number of mitogens, such as TP A (tetradecanoyl phorboi 

niO acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The EGRl gene 

expression is activated during this treatment. Thus, by stably transfecting PC 12 cells with 
C.n a construct containing an EGR promoter linked to SE AP reporter, activation of PC 1 2 cells 

m by METH 1 or METH2 can be assessed. 

L, The EGR/SEAP reporter construct can be assembled by the following protocol. The 

f0l5 EGR-1 promoter sequence (-633 to +l)(S^k.smoioKetaL, Oncogene 6:867-871 (1991)) can 

y be PCR amplified from human genomic DNA using the following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:89) 
5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQ ID NO:90) 
Using the GAS :SEAP/Neo vector produced in Example 15, EGRl amplified product 
20 can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector using restriction 

enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the EGRl amplified 
product with these same enzymes. Ligate the vector and the EGRl promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-115) in 30% ethanol (filter 
25 sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and allowed 

to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) containing 
10% horse serum (JRH BIOSCIENCES, Cat, # 12449-78P), 5% heat-inactivated fetal 
bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 ug/ml streptomycin 
30 on a precoated 10 cm tissue culture dish. One to four split is done every three to four days. 
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Cells are removed from the plates by scraping and resuspended with pipetting up and down 
for more than 15 times. 

Transf ect the EGR/SE AP/Neo construct into PCI 2 using the Lipofectamine protocol 
described in Example 14. EGR-SEAP/PC12 stable cells are obtained by growing the cells 
5 in 300 ug/ml G4 1 8 . The G4 1 8-free medium is used for routine growth but every one to two 

months, the cells should be re-grown in 300 ug/ml G418 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% confluent 
is screened by removing the old medium. Wash the cells once with PBS (Phosphate 
buffered saline). Then starve the cells in low serum medium (RPMI-1640 containing 1% 
,,J.O horse serum and 0.5% EBS with antibiotics) overnight. 

\D The next morning, remove the medium and wash the cells with PBS, Scrape off the 

^^.j cells from the plate, suspend the cells well in 2 ml low serum medium. Count the cell 

S; number and add more low serum medium to reach final cell density as 5x10^ cells/ml. 

^=0 Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 1x10^ 

s 15 cells/well). Add 50 ul supernatant produced by Example 14, 37''C for 48 to 72 hr. As a 

positive control, a growth factor known to activate PC 12 cells through EGR can be used, 
f such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold induction of SEAP is 

typically seen in the positive control wells. SEAP assay the supernatant according to 

Example 20, 

20 Example 19: High-Throughput Screening Assay for T-cell Activity 

NF-KB (Nuclear Factor KB) is a transcription factor activated by a wide variety of 
agents including the inflammatory cytokines IL- 1 and TNF, CD30 and CD40, lymphotoxin- 
alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by expression of certain 
viral gene products. As a transcription factor, NF-KB regulates the expression of genes 
25 involved in immune cell activation, control of apoptosis (NF- KB appears to shield cells 

from apoptosis), B and T-cell development, anti-viral and antimicrobial responses, and 
multiple stress responses. In non- stimulated conditions, NF- KB is retained in the 
cytoplasm with I-KB (Inhibitor KB). However, upon stimulation, I- KB is phosphorylated 
and degraded, causing NF- KB to shuttle to the nucleus, thereby activating transcription of 
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target genes. Target genes activated by NF- KB include JL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter constructs 
utilizing the NF-KB promoter element are used to screen the supernatants produced in 
Example 14. Activators or inhibitors of NF-KB would be useful in treating diseases. For 
example, inhibitors of NF-KB could be used to treat those diseases related to the acute or 
chronic activation of NF-KB, such as rheumatoid arthritis. 

To construct a vector containing the NF-KB promoter element, a PGR based strategy 
is employed. The upstream primer contains four tandem copies of the NF-KB binding site 
(GGGGACTTTCCC) (SEQ ID NO:91), 18 bp of sequence complementary to the 5' end of 
the SV40 early promoter sequence, and is flanked with an Xhol site: 
5':GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGACTT 

TCCATCCTGCCATCTCAATTAG:3^ (SEQ ID NO:92) 

The downstream primer is complementary to the 3' end of the SV40 promoter and 
is flanked with a Hind IE site: 

5':GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID NO:93) 

PGR ampUfication is performed using the S V40 promoter template present in the pB- 
gal:promoter plasmid obtained from Clontech. The resulting PGR fragment is digested with 
Xhol and Hind ffl and subcloned into BLSK2-. (Stratagene) Sequencing with the T7 and 
T3 primers confirms the insert contains the following sequence: 

5':CTGGAGGGGACTTTCCGGGGGAGTTTGGGGGGACTTTCCGGGACTTTCCAT 
GTGGGATCTCAATTAGTCAGGAACCATAGTCCCGGCCGTAACTCCGGGCATGG 
CGGCCCTAAGTCCGGCCAGTTCCGCCCATTCTCCGCCCGATGGCTGACTAATT 
TTTTTTATTTATGGAGAGGGGGAGGGCGCCTCGGCCTCTGAGGTATTCCAGAA 
GTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGGTT:3' (SEQ ID 

NO:88) 

Next, replace the S V40 minimal promoter element present in the pSEAP2-promoter 
plasmid (Clontech) with this NF-KB/S V40 fragment using Xhol and HindlH. However, this 
vector does not contain a neomycin resistance gene, and therefore, is not preferred for 
mammalian expression systems. 
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In order to generate stable mammalian cell lines, the NF-KB/S V40/SEAP cassette 
is removed from the above NF-KB/SEAP vector using restriction enzymes Sail and NotI, 
and inserted into a vector containing neomycin resistance. Particularly, the NF- 
KB/S V40/SEAP cassette was inserted into pGFP- 1 (Clontech), replacing the GFP gene, after 
5 restricting pGFP- 1 with Sail and Notl. 

Once NF-KB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are created and 
maintained according to the protocol described in Example 16, Similarly, the method for 
assaying supernatants with these stable Jurkat T-cells is also described in Example 16. As 
a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to wells H9, HIO, and HI 1, 
10 with a 5-10 fold activation typically observed. 

Example 20: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 16-19, SEAP activity 
is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the following 
general procedure. The Tropix Phospho-light Kit supplies the Dilution, Assay, and Reaction 
15 Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ul of 2.5x dilution 
buffer into Optiplates containing 35 ul of a supernatant. Seal the plates with a plastic sealer 
and incubate at 65°C for 30 min. Separate the Optiplates to avoid uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
20 prime with the Assay Buffer. Add 50 ml Assay Buffer and incubate at room temperature 5 

min. Empty the dispenser and prime with the Reaction Buffer (see the table below). Add 
50 ul Reaction Buffer and incubate at room temperature for 20 minutes. Since the intensity 
of the chemiluminescent signal is time dependent, and it takes about 10 minutes to read 5 
plates on luminometer, one should treat 5 plates at each time and start the second set 10 
25 minutes later. 

Read the relative light unit in the luminometer. Set H 12 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 
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10 



15 



20 



25 



30 



35 



40 



Reaction Buffer Formulation: 


#of plates 


Rxn buffer diluent (ml) 


CSPD (ml) 


10 


60 


3 


11 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


4 


15 


85 


4.25 


16 


90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 


28 


150 


7.5 


29 


155 


7.75 


30 


160 


8 


31 


165 


8,25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 
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Example 21: High-Throughput Screening Assay Identifying Changes in Small Molecule 
Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane potential. 
5 These alterations can be measured in an assay to identify supernatants which bind to 

receptors of a particular cell. Although the following protocol describes an assay for 
calcium, this protocol can easily be modified to detect changes in potassium, sodium, pH, 
membrane potential, or any other small molecule which is detectable by a fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to measure 
10 changes in fluorescent molecules (Molecular Probes) that bind small molecules. Clearly, 

any fluorescent molecule detecting a small molecule can be used instead of the calcium 
fluorescent molecule, fluo-3, used here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 96- 
well plate with clear bottom. The plate is incubated in a CO2 incubator for 20 hours. The 
15 adherent cells are washed two times in Biotek washer with 200 ul of HBSS (Hank's 

Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To load 
the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is incubated 
at 37 °C in a CO2 incubator for 60 min. The plate is washed four times in the Biotek washer 
20 with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are re- 
suspended to 2-5x10^ cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml fluo-3 
solution in 10% pluronic acid DMSO is added to each ml of cell suspension. The tube is 
then placed in a 37 ''C water bath for 30-60 min. The cells are washed twice with HBSS, 
25 resuspended to 1x10^ cells/ml, and dispensed into a microplate, 100 ul/welL The plate is 

centrifuged at 1000 rpm for 5 min. The plate is then washed once in Denley Cell Wash with 
200 ul, followed by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as fluo-3. 
The supernatant is added to the well, and a change in fluorescence is detected. 
30 To measure the fluorescence of intracellular calcium, the FLIPR is set for the 

following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 second; 
(3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and (6) Sample 
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addition is 50 ul. Increased emission at 530 nm indicates an extracellular signaling event 
caused by the a molecule, either METHl or METH2 or a molecule induced by METHl or 
METH2, which has resulted in an increase in the intracellular Ca''"' concentration. 

Example 22: High-Throughput Screening Assay Identifying Tyrosine Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of transmembrane and 
cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase RPTK) group are 
receptors for a range of mitogenic and metabolic growth factors including the PDGF, FGF, 
EGF, NGF, HGF and Insulin receptor subfamilies. In addition there are a large family of 
RPTKs for which the corresponding ligand is unknown. Ligands for RPTKs include mainly 
secreted small proteins, but also membrane-bound and extracellular matrix proteins. 

Activation of RPTK by Ugands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the cytoplasmic 
tyrosine kinases. The cytoplasmic tyrosine kinases include receptor associated tyrosine 
kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non-receptor linked and cytosoUc 
protein tyrosine kinases, such as the Jak family, members of which mediate signal 
transduction triggered by the cytokine superfamily of receptors (e.g., the Interleukins, 
Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine kinase 
activity, identifying whether METHl or METH2 or a molecule induced by METHl or 
METH2 is capable of activating tyrosine kinase signal transduction pathways is of interest. 
Therefore, the following protocol is designed to identify such molecules capable of 
activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 25,000 
cells per well in a 96 well Loprodyne Silent Screen Plates purchased from Nalge Nunc 
(Naperville, IL). The plates are sterilized with two 30 minute rinses with 100% ethanol, 
rinsed with water and dried overnight. Some plates are coated for 2 hr with 100 ml of cell 
culture grade type I collagen (50 mg/ml), gelatin (2%) or polylysine (50 mg/ml), all of 
which can be purchased from Sigma Chemicals (St. Louis, MO) or 10% Matrigel purchased 
from Becton Dickinson (Bedford,MA), or calf serum, rinsed with PBS and stored at 4°C. 
Cell growth on these plates is assayed by seeding 5,000 cells/well in growth medium and 
indirect quantitation of cell number through use of alamarBlue as described by the 
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manufacturer Alamar Biosciences, Inc, (Sacramento, CA) after 48 hr. Falcon plate covers 
#307 1 from Becton Dickinson (Bedford,M A) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest HI cell culture plates can also be used in some proliferation 
experiments. 

5 To prepare extracts, A43 1 cells are seeded onto the nylon membranes of Loprodyne 

plates (20,000/200ml/well) and cultured overnight in complete medium. Cells are quiesced 
by incubation in serum-free basal medium for 24 hr. After 5-20 minutes treatment with EGF 
(60ng/ml) or 50 ul of the supernatant produced in Example 14, the medium was removed 
and 100 ml of extraction buffer ((20 mM HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 

10 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 and a cocktail of protease inhibitors (# 

1836170) obtained from Boehringer Mannheim (Indianapolis, IN) is added to each well and 
the plate is shaken on a rotating shaker for 5 minutes at 4°C. The plate is then placed in a 
vacuum transfer manifold and the extract filtered through the 0.45 mm membrane bottoms 
of each well using house vacuum. Extracts are collected in a 96-well catch/assay plate in 

15 the bottom of the vacuum manifold and immediately placed on ice. To obtain extracts 

clarified by centrifugation, the content of each well, after detergent solubihzation for 5 
minutes, is removed and centrifuged for 15 minutes at 4°C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

20 Generally, the tyrosine kinase activity of a supernatant is evaluated by determining 

its ability to phosphorylate a tyrosine residue on a specific substrate (a biotinylated peptide). 
Biotinylated peptides that can be used for this purpose include PSKl (corresponding to 
amino acids 6-20 of the cell division kinase cdc2-p34) and PSK2 (corresponding to amino 
acids 1-17 of gastrin). Both peptides are substrates for a range of tyrosine kinases and are 

25 available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in order. 
First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg^-" (5mM ATP/50mM 
MgC12), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, pH7.3, 40 mM beta- 
glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnCl^, 0.5 mg/ml BSA), then 5ul 

30 of Sodium Vanadate(lmM), and then 5ul of water. Mix the components gently and 

preincubate the reaction mix at 30°C for 2 min. Initial the reaction by adding lOul of the 
control enzyme or the filtered supernatant. 
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The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microtiter plate (MTP) module and incubating at 37 ° C for 20 min. This allows 
5 the streptavadin coated 96 well plate to associate with the biotinylated peptide. Wash the 

MTP module with 300ul/well of PBS four times. Next add 75 ul of anti-phospotyrosine 
antibody conjugated to horse radish peroxidase(anti-P-Tyr-POD(0.5u/ml)) to each well and 
incubate at 37°C for one hour. Wash the well as above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
10 incubate at room temperature for at least 5 mins (up to 30 min). Measure the absorbance of 

the sample at 405 nm by using ELIS A reader. The level of bound peroxidase activity is 
quantitated using an ELIS A reader and reflects the level of tyrosine kinase activity. 

Example 23: High-Throughput Screening Assay Identifying Phosphorylation Activity 

As a potential alternative and/or compUment to the assay of protein tyrosine kinase 
15 activity described in Example 22, an assay which detects activation (phosphorylation) of 

major intracellular signal transduction intermediates can also be used. For example, as 
described below one particular assay can detect tyrosine phosphorylation of the Erk-1 and 
Erk-2 kinases. However, phosphorylation of other molecules, such as Raf, JNK, p38 MAP, 
Map kinase kinase (MEK), MEK kinase, Src, Muscle specific kinase (MuSK), IRAK, Tec, 
20 and Janus, as well as any other phosphoserine, phosphotyrosine, or phosphothreonine 

molecule, can be detected by substituting these molecules for Erk-1 or Erk-2 in the 
following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELIS A plate 
with 0. 1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then rinsed 

25 with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates are then 

treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 and Erk-2 (1 
hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this step can easily be 
modified by substituting a monoclonal antibody detecting any of the above described 
molecules.) After 3-5 rinses with PBS, the plates are stored at 4°C until use. 

30 A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and cultured 

ovemight in growth medium. The cells are then starved for 48 hr in basal medium (DMEM) 
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and then treated with EGF (6ng/well) or 50 ul of the supernatants obtained in Example 14 
for 5-20 minutes. The cells are then solubilized and extracts filtered directly into the assay 
plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/well) is used in place of 
A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and Erk-2 
kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The bound 
polyclonal antibody is then quantitated by successive incubations with Europium- 
streptavidin and Europium fluorescence enhancing reagent in the Wallac DELFIA 
instrument (time-resolved fluorescence). An increased fluorescent signal over background 
indicates a phosphorylation by METHl or METH2 or a molecule induced by METHl or 
METH2. 

Example 24: Method of Determining Alterations in the METHl or METHl Gene 

RN A isolated from entire families or individual patients presenting with a phenotype 
of interest (such as a disease) is be isolated. cDNA is then generated from these RNA 
samples using protocols known in the art. (See, Sambrook.) The cDNA is then used as a 
template for PCR, employing primers surrounding regions of interest in SEQ ID NO:l. 
Suggested PCR conditions consist of 35 cycles at 95^*0 for 30 seconds; 60-120 seconds at 
52-58°C; and 60-120 seconds at 70°C, using buffer solutions described in Sidransky, D. et 
al. Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). The 
intron-exon borders of selected exons of METHl or METH2 is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring suspected 
mutations in METHl or METH2 is then cloned and sequenced to validate the results of the 
direct sequencing. 

PCR products of METHl or METH2 are cloned into T-tailed vectors as described 
in Holton, T.A. and Graham, M,W,, Nucleic Acids Research 19: 1 156 (1991) and sequenced 
with T7 polymerase (United States Biochemical). Affected individuals are identified by 
mutations in METHl or METH2 not present in unaffected individuals. 
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Genomic rearrangements are also observed as a method of determining alterations 
in the METHl or METH2 gene. Isolated genomic clones are nick-translated with 
digoxigenindeoxy-uridine 5 '-triphosphate (Boehringer Manheim), and FISH performed as 
described in Johnson, Cg. et al., Methods Cell Biol. 35:73-99 (1991). Hybridization with 

5 the labeled probe is carried out using a vast excess of human cot-1 DNA for specific 

hybridization to the METHl or METH2 genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenyhdole and propidium 
iodide, producing a combination of C- and R-bands. Aligned images for precise mapping 
are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 

10 combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 

variable excitation wavelength filters. (Johnson, Cv. et al. Genet. Anal Tech. Appl 8:15 
(1991)0 Image collection, analysis and chromosomal fractional length measurements are 
performed using the ISee Graphical Program System. (Inovision Corporation, Durham, NC.) 
Chromosome alterations of the genomic region of METHl or METH2 (hybridized by the 

1 5 probe) are identified as insertions, deletions, and translocations . These METH 1 or METH2 

alterations are used as a diagnostic marker for an associated disease. 

Example 25: Method of Detecting Abnormal Levels of METHl or METH2 in a 
Biological Sample 

METHl or METH2 polypeptides can be detected in a biological sample, and if an 
20 increased or decreased level of METH 1 or METH2 is detected, this polypeptide is a marker 

for a particular phenotype. Methods of detection are numerous, and thus, it is understood 
that one skilled in the art can modify the following assay to fit their particular needs. 

For example, antibody-sandwich ELISAs are used to detect METHl or METH2 in 
a sample, preferably a biological sample. Wells of a microtiter plate are coated with specific 
25 antibodies to METHl or METH2, at a final concentration of 0.2 to lOug/ml. The antibodies 

are either monoclonal or polyclonal and are produced by the method described in Example 
13. The wells are blocked so that non-specific binding of METHl or METH2 to the well 
is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample containing 
30 METHl or METH2, Preferably, serial dilutions of the sample should be used to validate 
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results. The plates are then washed three times with deionized or distilled water to remove 
unbounded METHl or METH2. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a concentration 
of 25-400 ng, is added and incubated for 2 hours at room temperature. The plates are again 
washed three times with deionized or distilled water to remove unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate 
(NPP) substrate solution to each well and incubate 1 hour at room temperature. Measure the 
reaction by a microtiter plate reader. Prepare a standard curve, using serial dilutions of a 
control sample, and plot METHl or METH2 polypeptide concentration on the X-axis (log 
scale) and fluorescence or absorbance of the Y-axis (linear scale). Interpolate the 
concentration of the METHl or METH2 in the sample using the standard curve. 

Example 26: Formulating a Polypeptide 

The METHl or METH2 composition will be formulated and dosed in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the METHl or METH2 
polypeptide alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 
purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of METHl or 
METH2 administered parenterally per dose will be in the range of about lug/kg/day to 10 
mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, METHl or METH2 is typically administered at a dose rate of about 1 
ug/kg/hour to about 50 ug/kg/hour, either by 1-4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag solution may 
also be employed. The length of treatment needed to observe changes and the interval 
following treatment for responses to occur appears to vary depending on the desired effect. 

Pharmaceutical compositions containing METH 1 or METH2 are administered orally, 
rectally, parenterally, intracistemally, intravaginally, intraperitoneally, topically (as by 
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powders, ointments, gels, drops or transdermal patch), bucally, or as an oral or nasal spray. 
"Pharmaceutically acceptable carrier" refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of any type. The term "parenteral" 
as used herein refers to modes of administration which include intravenous, intramuscular, 
intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion. 

METHl or METH2 is also suitably administered by sustained-release systems. 
Suitable examples of sustained-release compositions include semi-permeable polymer 
matrices in the form of shaped articles, e.g., films, or mirocapsules. Sustained-release 
matrices include polylactides (U.S. Pat. No. 3,773,919, EP 58,481), copolymers of L- 
glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et a/., Biopolymers 22:5 Al -556 
(1983)), poly (2- hydroxyethyl methacrylate) (R. Langer et aL, /. Biomed. Mater, Res. 
i5: 167-277 (1981), andR. Langer, Chem, Tech. 72:98-105 (1982)), ethylene vinyl acetate 
(R. Langer et al) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release 
compositions also include liposomally entrapped METHl or METH2 polypeptides. 
Liposomes containing the METHl or METH2 are prepared by methods known per se: DE 
3,218,121; Epstein et aL.Proc. Natl Acad ScL USA S2:3688-3692 (1985); Hwang et al, 
Proc. Natl. Acad ScL USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 
143,949; EP 142,641; Japanese Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 
4^544,545; and EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 
Angstroms) unilamellar type in which the lipid content is greater than about 30 mol. percent 
cholesterol, the selected proportion being adjusted for the optimal secreted polypeptide 
therapy. 

For parenteral administration, in one embodiment, METH 1 or METH2 is formulated 
generally by mixing it at the desired degree of purity, in a unit dosage injectable form 
(solution, suspension, or emulsion), with a pharmaceutically acceptable carrier, i.e., one that 
is non-toxic to recipients at the dosages and concentrations employed and is compatible with 
other ingredients of the formulation. For example, the formulation preferably does not 
include oxidizing agents and other compounds that are known to be deleterious to 
polypeptides. 

Generally, the formulations are prepared by contacting METHl or METH2 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. Then, 
if necessary, the product is shaped into the desired formulation. Preferably the carrier is a 
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parenteral carrier, more preferably a solution that is isotonic with the blood of the recipient. 
Examples of such carrier vehicles include water, saline. Ringer's solution, and dextrose 
solution. Non-aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as 
well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at the 
dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 
acid; low molecular weight (less than about ten residues) polypeptides, e.g., polyarginine or 
tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic 
acid, or arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; 
sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates, poloxamers, or PEG. 

METHl or METH2 is typically formulated in such vehicles at a concentration of 
about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be 
understood that the use of certain of the foregoing excipients, carriers, or stabilizers will 
result in the formation of polypeptide salts. 

METHl or METH2 used for therapeutic administration can be sterile. Sterility is 
readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutic polypeptide compositions generally are placed into a container 
having a sterile access port, for example, an intravenous solution bag or vial having a 
stopper pierceable by a hypodermic injection needle. 

METHl or METH2 polypeptides ordinarily will be stored in unit or multi-dose 
containers, for example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials are 
filled with 5 ml of sterile-filtered 1% (w/v) aqueous METHl or METH2 polypeptide 
solution, and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized METHl or METH2 polypeptide using bacteriostatic Water- 
for-Injection. 
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The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions of 
the invention. Associated with such container(s) can be a notice in the form prescribed by 
a governmental agency regulating the manufacture, use or sale of pharmaceuticals or 
biological products, which notice reflects approval by the agency of manufacture, use or sale 
for human administration. In addition, METH 1 or METH2 may be employed in conjunction 
with other therapeutic compounds. 

The compositions of the invention maybe administered alone or in combination with 
other therapeutic agents. Therapeutic agents that may be administered in combination with 
the compositions of the invention include, but are not limited to, other members of the TNF 
family, chemotherapeutic agents, antibiotic, steroidal and non-steroidal anti-inflammatories, 
conventional immunotherapeutic agents, cytokines and/or growth factors. Combination may 
be administered either concomitantly, e.g. as an admixture; separately but simultaneously 
or concurrently; or sequentially. This includes presentations in which the combined agents 
are administered together as a therapeutic mixture, and also procedures in which the 
combined agents are administered separately but simultaneously, e.g. as through separate 
intravenous lines into the same individual. Administration "in combination" further includes 
the separate administration of one of the compounds or agents given first, followed by the 
second. 

In one embodiment, the compositions of the invention are administered in 
combination with other members of the TNF family. TNF, TNF-related or TNF-like 
molecules that may be administered with the compositions of the invention include, but are 
not limited to, soluble forms of TNF-alpha, lymphotoxin alpha (LT-alpha, also known as 
TNF-beta), LT-beta (found in complex heterotrimer LT-alph2-beta), OPGL, FasL, CD27L, 
CD30L, CD40L, 4-lBBL, DcR3, OX40L, TNF-gamma (International Publication No. WO 
96/14328), AIM-I (Intemational Publication No. WO 97/33899), endokine-alpha 
(International Publication No. WO 98/07880), TR6 (International Publication No. WO 
98/30694), OPG, and neutrokine-alpha (International Publication No. WO 98/18921), 
OX40, and nerve growth factor (NGF) and soluble forms of Fas, CD30, CD27, CD40 and 
4-IBB, TR2 (Intemational PubUcation No. WO 98/34095), DR3 (Liternational PubHcation 
No. WO 97/33904), DR4 (International Publication No. WO 98 /32856), TR5 (International 
Publication No. WO 98/30693), TR6 (International Publication No. WO 98/30694), TR7 
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(International Publication No. WO 98/4 1629), TRANK, TR9 (International Publication No. 
WO 98/56892), TRIO (International Publication No. WO 98/54202), 312C2 (International 
Publication No. WO 98/06842), and TR12, and soluble forms of CD154, CD70 and CD153. 

Conventional nonspecific immunosuppressive agents that may be administered in 
combination with the compositions of the invention include, but are not limited to, steroids, 
cyclosporine, cyclosporine analogs, ayclophosphamide methylprednisone, prednisone, 
azathioprine, FK-506, 1 5-deoxyspergulain, and other immunosuppressive agents that act by 
suppressing the function of responding T cells. 

In a further embodiment, the compositions of the invention are administered in 
combination with an antibiotic agent. Antibiotic agents that may be administered with the 
compositions of the invention include, but are not limited to, tetracycline, metronidazole, 
amoxicillin, beta-lactamases, aminoglycosides, macrolides, quinolones, fluoroquinolones, 
cephalosporins, erythromycin, ciprofloxacin, and streptomycin. 

In an additional embodiment, the compositions of the invention are administered 
alone or in combination with an anti-inflammatory agent. Anti-inflammatory agents that 
may be administered with the compositions of the invention include, but are not limited to, 
glucocorticoids and the nonsteroidal anti-inflammatories, aminoarylcarboxylic acid 
derivatives, arylacetic acid derivatives, arylbutyric acid derivatives, arylcarboxylic acids, 
arylpropionic acid derivatives, pyrazoles, pyrazolones, salicylic acid derivatives, 
thiazinecarboxamides, eacetamidocaproic acid, S-adenosylmethionine, 3-amino-4- 
hydroxybutyric acid, amixetrine, bendazac, benzydamine, bucolome, difenpiramide, ditazol, 
emorfazone, guaiazulene, nabumetone, nimesulide, orgotein, oxaceprol, paranyline, 
perisoxal, pifoxime, proquazone, proxazole, and tenidap. 

In another embodiment, compositions of the invention are administered in 
combination with a chemotherapeutic agent. Chemotherapeutic agents that may be 
administered with the compositions of the invention include, but are not limited to, 
antibiotic derivatives (e.g., doxorubicin, bleomycin, daunorubicin, and dactinomycin); 
antiestrogens (e.g., tamoxifen); antimetabolites (e.g., fluorouracil, 5-FU, methotrexate, 
floxuridine, interferon alpha-2b, glutamic acid, plicamycin, mercaptopurine, and 6- 
thioguanine); cytotoxic agents (e.g., carmustine, BCNU, lomustine, CCNU, cytosine 
arabinoside, cyclophosphamide, estramustine, hydroxyurea, procarbazine, mitomycin, 
busulfan, cis-platin, and vincristine sulfate); hormones (e.g., medroxyprogesterone. 
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estramustine phosphate sodium, ethenyl estradiol, estradiol, megestrol acetate, 
methyltestosterone, diethyls tilbestrol diphosphate, chlorotrianisene, and testolactone); 
nitrogen mustard derivatives (e.g., mephalen, chorambucil, mechlorethamine (nitrogen 
mustard) and thiotepa); steroids and combinations (e.g., bethamethasone sodium phosphate); 
5 and others (e.g., dicarbazine, asparaginase, mitotane, vincristine sulfate, vinblastine sulfate, 

and etoposide). 

In an additional embodiment, the compositions of the invention are administered in 
combination with cytokines. Cytokines that may be administered with the compositions of 
the invention include, but are not limited to, IL2, IL3, IL4, IL5, IL6, TLl, ILIO, IL12, IL13, 

10 IL15, anti-CD40, CD40L, DFN-gamma and TNF-alpha. 

In an additional embodiment, the compositions of the invention are administered in 
combination with angiogenic proteins. Angiogenic proteins that may be administered with 
the compositions of the invention include, but are not limited to, Glioma Derived Growth 
Factor (GDGF), as disclosed in European Patent Number EP-399816; Platelet Derived 

1 5 Growth Factor- A (PDGF- A), as disclosed in European Patent Number EP-682 110; Platelet 

Derived Growth Factor-B (PDGF-B), as disclosed in European Patent Number EP-2823 17; 
Placental Growth Factor (PIGF), as disclosed in International Publication Number WO 
92/06194; Placental Growth Factor-2 (PlGF-2), as disclosed in Hauser et al.. Growth 
Factors, 4:259-268 (1993); Vascular Endothelial Growth Factor (VEGF), as disclosed in 

20 International Publication Number WO 90/13649; Vascular Endothelial Growth Factor-A 

(VEGF-A), as disclosed in European Patent Number EP-506477; Vascular Endothelial 
Growth Factor-2 (VEGF-2), as disclosed in International Publication Number WO 
96/39515; Vascular Endothelial Growth Factor B-186 (VEGF-B 186), as disclosed in 
International Publication Number WO 96/26736; Vascular Endothelial Growth Factor-D 

25 (VEGF-D), as disclosed in International PubUcation Number WO 98/02543; Vascular 

Endothelial Growth Factor-D (VEGF-D), as disclosed in International Publication Number 
WO 98/07832; and Vascular Endothelial Growth Factor-E (VEGF-E), as disclosed in 
German Patent Number DEI 9639601. The above mentioned references are incorporated 
herein by reference herein. 

30 In an additional embodiment, the compositions of the invention are administered in 

combination with Fibroblast Growth Factors. Fibroblast Growth Factors that may be 
administered with the compositions of the invention include, but are not limited to, FGF- 1, 
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FGF-2, FGF-3, FGF-4, FGF-5, FGF-6, FGF-7, FGF-8, FGF-9, FGF-10, FGF-1 1, FGF-12, 
FGF-13, FGF-14, and FGF-15. 

In additional embodiments, the compositions of the invention are administered in 
combination with other therapeutic or prophylactic regimens, such as, for example, radiation 
5 therapy. 

Example 27: Method of Treating Decreased Levels ofMETHl or METH2 

The present invention relates to a method for treating an individual in need of a 
decreased level of METHl or METH2 activity in the body comprising, administering to 
such an individual a composition comprising a therapeutically effective amount ofMETHl 

10 or METH2 antagonist. Preferred antagonists for use in the present invention are METHl 

or METH2-specific antibodies. 

Moreover, it will be appreciated that conditions caused by a decrease in the standard 
or normal expression level of METHl or METH2 in an individual can be treated by 
administering METHl or METH2, preferably in the secreted form. Thus, the invention also 

1 5 provides a method of treatment of an individual in need of an increased level of METHl or 

METH2 polypeptide comprising administering to such an individual a pharmaceutical 
composition comprising an amount of METHl or METH2 to increase the activity level of 
METHl or METH2 in such an individual. 

For example, a patient with decreased levels of METHl or METH2 polypeptide 

20 receives a daily dose 0. 1- 100 ug/kg of the polypeptide for six consecutive days. Preferably, 

the polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 26. 

Example 28: Method of Treating Increased Levels ofMETHl or METH2 

The present invention also relates to a method for treating an individual in need of 
25 an increased level of METHl or METH2 activity in the body comprising administering to 

such an individual a composition comprising a therapeutically effective amount ofMETHl 
or METH2 or an agonist thereof. 
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Antisense technology is used to inhibit production of METHl or METH2, This 
technology is one example of a method of decreasing levels of METHl or METH2 
polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of METHl or 
5 METH2 is administered intravenously antisense polynucleotides at 0.5, 1 .0, 1 .5, 2.0 and 3 .0 

mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the treatment 
was well tolerated. The formulation of the antisense polynucleotide is provided in Example 
26. 

Example 29: Method of Treatment Using Gene Therapy - Ex Vivo 

1 0 One method of gene therapy transplants fibroblasts , which are capable of expressing 

METHl or METH2 polypeptides, onto a patient. Generally, fibroblasts are obtained from 
a subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is turned 

15 upside down, closed tight and left at room temperature over night. After 24 hours at room 

temperature, the flask is inverted and the chunks of tissue remain fixed to the bottom of the 
flask and fresh media (e.g., Ham's F12 media, with 10% FBS, penicillin and streptomycin) 
is added. The flasks are then incubated at 37 °C for approximately one week. 

At this time, fresh media is added and subsequently changed every several days. 

20 After an additional two weeks in culture, a monolayer of fibroblasts emerge. The monolayer 

is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al, DNA 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and HindlH 
and subsequently treated with calf intestinal phosphatase. The linear vector is fractionated 

25 on agarose gel and purified, using glass beads. 

The cDN A encoding METH 1 or METH2 can be amplified using PGR primers which 
correspond to the 5' and 3' end sequences respectively as set forth in Example 5. Preferably, 
the 5' primer contains an EcoRI site and the 3' primer includes a HindlH site. Equal 
quantities of the Moloney murine sarcoma virus linear backbone and the amplified EcoRI 

30 and HindlH fragment are added together, in the presence of T4 DNA ligase. The resulting 
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mixture is maintained under conditions appropriate for ligation of the two fragments. The 
ligation mixture is then used to transform bacteria HBlOl, which are then plated onto agar 
containing kanamycin for the purpose of confirming that the vector contains properly 
inserted METHl or METH2. 
5 The amphotropic pA3 17 or GP+aml2 packaging cells are grown in tissue culture to 

confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CS), penicillin and streptomycin. The MS V vector containing the METH 1 or METH2 gene 
is then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the METHl or METH2 

10 gene (the packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 
the infectious viral particles, is filtered through a millipore filter to remove detached 
producer cells and this media is then used to infect fibroblast cells. Media is removed from 

15 a sub-confluent plate of fibroblasts and quickly replaced with the media from the producer 

cells. This media is removed and replaced with fresh media. If the titer of virus is high, 
then virtually all fibroblasts will be infected and no selection is required. If the titer is very 
low, then it is necessary to use a retroviral vector that has a selectable marker, such as neo 
or his. Once the fibroblasts have been efficiently infected, the fibroblasts are analyzed to 

20 determine whether METHl or METH2 protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. 

Example 30: Method of Treatment Using Gene Therapy - In Vivo 

25 Another aspect of the present invention is using in vivo gene therapy methods to 

treat disorders, diseases and conditions. The gene therapy method relates to the introduction 
of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) METHl or METH2 
sequences into an animal to increase or decrease the expression of the METHl or METH2 
polypeptide. The METHl or METH2 polynucleotide may be operatively linked to a 

30 promoter or any other genetic elements necessary for the expression of the METHl or 

METH2 polypeptide by the target tissue. Such gene therapy and delivery techniques and 
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methods are known in the art, see, for example, WO 90/1 1092, W098/1 1779; U.S. Patent 
No. 5,693,622, 5,705,151, 5,580,859; Tabata, H. etal (1997) Cardiovasc, Res. 35(3)A10- 
479, Chao,J. etal {1991) Pharmacol Res, 35(6):5n '522,Wolff J, A, (1997) Neuromuscul 
Disord, 7f5;:314-318, Schwartz, B, etal (1996) Gene Ther, 5(5;:405-411, Tsurumi, Y. et 
5 al (1996) Circulation 94(12 j:328 1-3290 (incorporated herein by reference). 

The METHl or METH2 polynucleotide constructs may be delivered by any method 
that delivers injectable materials to the cells of an animal, such as, injection into the 
interstitial space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
METHl or METH2 polynucleotide constructs can be delivered in a pharmaceutically 

10 acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 
from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
agents and the like. However, the METHl or METH2 polynucleotides may also be 

1 5 delivered in liposome formulations (such as those taught in Feigner, P.L. et al (1995) Ann, 

NY Acad, ScL 772:126-139 and Abdallah, B. a/. (1995) Biol Cell 85(1 y.l-l) which can 
be prepared by methods well known to those skilled in the art. 

The METHl or METH2 polynucleotide vector constructs used in the gene therapy 
method are preferably constructs that will not integrate into the host genome nor will they 

20 contain sequences that allow for replication. Any strong promoter known to those skilled 

in the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into target 
cells is the transitory nature of the polynucleotide synthesis in the cells. Studies have shown 
that non-replicating DNA sequences can be introduced into cells to provide production of 

25 the desired polypeptide for periods of up to six months. 

The METHl or METH2 polynucleotide construct can be delivered to the interstitial 
space of tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, 
bone marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective 

30 tissue. Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 

matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
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ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by the 
plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to the 
interstitial space of muscle tissue is preferred for the reasons discussed below. They may 
be conveniently delivered by injection into the tissues comprising these cells. They are 
5 preferably delivered to and expressed in persistent, non-dividing cells which are 

differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

10 For the naked METHl or METH2 polynucleotide injection, an effective dosage 

amount of DNA or RNA will be in the range of from about 0.0005 g/kg body weight to 
about 50 mg/kg body weight. Preferably the dosage will be from about 0.005 mg/kg to 
about 20 mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, 
as the artisan of ordinary skill will appreciate, this dosage will vary according to the tissue 

15 site of injection. The appropriate and effective dosage of nucleic acid sequence can readily 

be determined by those of ordinary skill in the art and may depend on the condition being 
treated and the route of administration. The preferred route of administration is by the 
parenteral route of injection into the interstitial space of tissues. However, other parenteral 
routes may also be used, such as, inhalation of an aerosol formulation particularly for 

20 delivery to lungs or bronchial tissues, throat or mucous membranes of the nose. In addition, 

naked METHl or METH2 polynucleotide constructs can be delivered to arteries during 
angioplasty by the catheter used in the procedure. 

The dose response effects of injected METHl or METH2 polynucleotide in muscle 
in vivo is determined as follows. Suitable METH 1 or METH2 template DNA for production 

25 of mRNA coding for METHl or METH2 polypeptide is prepared in accordance with a 

standard recombinant DNA methodology. The template DNA, which may be either circular 
or linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by intraperitoneal 

30 injection with 0.3 ml of 2.5% Avertin. A 1 .5 cm incision is made on the anterior thigh, and 

the quadriceps muscle is directly visualized. The METHl or METH2 template DNA is 
injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
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approximately 0,5 cm from the distal insertion site of the muscle into the knee and about 0,2 
cm deep. A suture is placed over the injection site for future localization, and the skin is 
closed with stainless steel clips. 

After an appropriate incubation time (e.g,, 7 days) muscle extracts are prepared by 
5 excising the entire quadriceps. Every fifth 1 5 um cross-section of the individual quadriceps 

muscles is histochemically stained for METHl or METH2 protein expression. A time 
course for METHl or METH2 protein expression may be done in a similar fashion except 
that quadriceps from different mice are harvested at different times. Persistence of METHl 
or METH2 DNA in muscle following inj ection may be determined by Southern blot analysis 
10 after preparing total cellular DNA and HIRT supernatants from injected and control mice. 

The results of the above experimentation in mice can be use to extrapolate proper dosages 
and other treatment parameters in humans and other animals using METHl or METH2 
naked DNA. 



Example 31: Gene Therapy Using Endogenous METHl and/or METH2 Gene 

1 5 Another method of gene therapy according to the present invention involves operably 

associating the endogenous METHl and/or METH2 sequence with a promoter via 
homologous recombination as described, for example, in U.S. Patent No. 5,64 1 ,670, issued 
June 24, 1997; hiternationalPublicationNo. WO 96/29411, published September 26, 1996; 
International Publication No. WO 94/12650, pubUshed August 4, 1994; KoUer et al.Proc, 

20 Natl Acad, Sci. USA 85:8932-8935 (1989); and Zijlstra et aL, Nature 342:435-438 (1989). 

This method involves the activation of a gene which is present in the target cells, but which 
is not expressed in the cells, or is expressed at a lower level than desired. 

Polynucleotide constructs are made which contain a promoter and targeting 
sequences, which are homologous to the 5' non-coding sequence of endogenous METHl 

25 and/or METH2, flanking the promoter. The targeting sequence will be sufficiently near the 

5' end of METHl and/or METH2 so the promoter will be operably linked to the endogenous 
sequence upon homologous recombination. The promoter and the targeting sequences can 
be amplified using PGR. Preferably, the amplified promoter contains distinct restriction 
enzyme sites on the 5' and 3' ends. Preferably, the 3' end of the first targeting sequence 

30 contains the same restriction enzyme site as the 5' end of the amplified promoter and the 5' 
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end of the second targeting sequence contains the same restriction site as the 3' end of the 
amplified promoter. 

The amplified promoter and the amplified targeting sequences are digested with the 
appropriate restriction enzymes and subsequently treated with calf intestinal phosphatase. 
5 The digested promoter and digested targeting sequences are added together in the presence 

of T4 DNA ligase. The resulting mixture is maintained under conditions appropriate for 
ligation of the two fragments. The construct is size fractionated on an agarose gel then 
purified by phenol extraction and ethanol precipitation. 

In this Example, the polynucleotide constructs are administered as naked 

10 polynucleotides via electroporation. However, the polynucleotide constructs may also be 

administered with transfection-facilitating agents, such as liposomes, viral sequences, viral 
particles, precipitating agents, etc. Such methods of delivery are known in the art. 

Once the cells are transfected, homologous recombination will take place which 
results in the promoter being operably linked to the endogenous METHl and/or METH2 

15 sequence. This results in the expression of METHl and/or METH2 in the cell. Expression 

may be detected by immunological staining, or any other method known in the art. 

Fibroblasts are obtained from a subject by skin biopsy. The resulting tissue is placed 
in DMEM + 10% fetal calf serum. Exponentially growing or early stationary phase 
fibroblasts are trypsinized and rinsed from the plastic surface with nutrient medium. An 

20 aliquot of the cell suspension is removed for counting, and the remaining cells are subjected 

to centrifugation. The supernatant is aspirated and the pellet is resuspended in 5 ml of 
electroporation buffer (20 mM HEPES pH 7.3, 137 mM NaCl, 5 mM KCl, 0.7 mM Na^ 
HPO4, 6 mM dextrose). The cells are recentrifuged, the supernatant aspirated, and the cells 
resuspended in electroporation buffer containing 1 mg/ml acetylated bovine serum albumin. 

25 The final cell suspension contains approximately 3X10^ cells/ml. Electroporation should be 

performed immediately following resuspension. 

Plasmid DNA is prepared according to standard techniques. For example, to 
construct a plasmid for targeting to the METH 1 and/or METH2 locus , plasmid pUC 1 8 (MBI 
Fermentas, Amherst, NY) is digested with HindHI. The CMV promoter is amplified by 

30 PGR with an Xbal site on the 5' end and a BamHI site on the 3'end. Two METHl and/or 

METH2 non-coding sequences are amplified via PGR: one METHl and/or METH2 non- 
coding sequence (METHl and/or METH2 fragment 1) is amplified with a Hindin site at the 
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5' end and an Xba site at the 3'end; the other METHl and/or METH2 non-coding sequence 
(METHl and/or METH2 fragment 2) is amplified with a BamHI site at the 5'end and a 
Hindm site at the 3^end. The CMV promoter and METHl and/or METH2 fragments are 
digested with the appropriate enzymes (CMV promoter - Xbal and BamHI; METHl and/or 
5 METH2 fragment 1 - Xbal; METHl and/or METH2 fragment 2 - BamHI) and ligated 

together. The resulting ligation product is digested with Hindlll, and ligated with the 
Hindni-digested pUC18 plasmid. 

Plasmid DNA is added to a sterile cuvette with a 0.4 cm electrode gap (Bio-Rad). 
The final DNA concentration is generally at least 120 |i g/ml. 0.5 ml of the cell suspension 

10 (containing approximately I.S.XIO^ cells) is then added to the cuvette, and the cell 

suspension and DNA solutions are gently mixed, Electroporation is performed with a 
Gene-Pulser apparatus (Bio-Rad). Capacitance and voltage are set at 960 |liF and 250-300 
V, respectively. As voltage increases, cell survival decreases, but the percentage of surviving 
cells that stably incorporate the introduced DNA into their genome increases dramatically. 

15 Given these parameters, a pulse time of approximately 14-20 mSec should be observed, 

Electroporated cells are maintained at room temperature for approximately 5 min, 
and the contents of the cuvette are then gently removed with a sterile transfer pipette. The 
cells are added directly to 10 ml of pre warmed nutrient media (DMEM with 15% calf 
serum) in a 10 cm dish and incubated at 37 °C. The following day, the media is aspirated and 

20 replaced with 10 ml of fresh media and incubated for a further 16-24 hours. 

The engineered fibroblasts are then injected into the host, either alone or after having 
been grown to confluence on cytodex 3 microcarrier beads. The fibroblasts now produce 
the protein product. The fibroblasts can then be introduced into a patient as described above. 



Example 32: METHl and/or METH2 Transgenic Animals 

25 The METHl and/or METH2 polypeptides can also be expressed in transgenic 

animals. Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
guinea pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
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embodiment, techniques described herein or otherwise known in the art, are used to express 
polypeptides of the invention in humans, as part of a gene therapy protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of transgenic 
5 animals. Such techniques include, but are not limited to, pronuclear microinjection 

(Paterson et al, Appl Microbiol Biotechnol ^0:691-698 (1994); Carver et al. 
Biotechnology (NY) 77:1263-1270 (1993); Wright et al. Biotechnology (NY) 9:830-834 
(1991); andHoppe etal,\J,S, Pat No. 4,873,191 (1989)); retrovirus mediated gene transfer 
into germ lines (Van der Putten et al, Proa Natl Acad Set USA 82:6148-6152 (1985)), 

10 blastocysts or embryos; gene targeting in embryonic stem cells (Thompson et al. Cell 

56:313-321 (1989)); electroporation of cells or embryos (Lo, 1983, Mol Cell Biol 3: 1803- 
1814 (1983)); introduction of the polynucleotides of the invention using a gene gun (see, 
e.g., Ulmer et al. Science 259:1745 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 

15 and sperm-mediated gene transfer (Lavitrano et al. Cell 57:717-723 (1989); etc. For a 

review of such techniques, see Gordon, "Transgenic Animals," Intl, Rev. Cytol. 1 15: 17 1-229 
(1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones containing 
polynucleotides of the invention, for example, nuclear transfer into enucleated oocytes of 

20 nuclei from cultured embryonic, fetal, or adult cells induced to quiescence (Campell et al. 

Nature 5S0:64-66 (1996); Wilmut et a/.. Nature 585:810-813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in all 
their cells, as well as animals which carry the transgene in some, but not all their cells, i.e., 
mosaic animals or chimeric. The transgene may be integrated as a single transgene or as 

25 multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail tandems. 

The transgene may also be selectively introduced into and activated in a particular cell type 
by following, for example, the teaching of Lasko et al (Lasko et al , Proc. Natl Acad. Set 
USA 89:6232-6236 (1992)). The regulatory sequences required for such a cell- type specific 
activation will depend upon the particular cell type of interest, and will be apparent to those 

30 of skill in the art. When it is desired that the polynucleotide transgene be integrated into the 

chromosomal site of the endogenous gene, gene targeting is preferred. 
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Briefly, when such a technique is to be utiUzed, vectors containing some nucleotide 
sequences homologous to the endogenous gene are designed for the purpose of integrating, 
via homologous recombination with chromosomal sequences, into and disrupting the 
function of the nucleotide sequence of the endogenous gene. The transgene may also be 
5 selectively introduced into a particular cell type, thus inactivating the endogenous gene in 

only that cell type, by following, for example, the teaching of Gu et al (Gu et al. Science 
2<55:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell t5^e of interest, and will be apparent to 
those of skill in the art. The contents of each of the documents recited in this paragraph is 
[3 10 herein incorporated by reference in its entirety. 

y Once transgenic animals have been generated, the expression of the recombinant 

J'J gene may be assayed utilizing standard techniques. Initial screening may be accomplished 

by Southern blot analysis or PGR techniques to analyze animal tissues to verify that 
m integration of the transgene has taken place. The level of mRNA expression of the transgene 

1^15 in the tissues of the transgenic animals may also be assessed using techniques which include, 

^ but are not limited to, Northern blot analysis of tissue samples obtained from the animal, in 

id situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). Samples of transgenic 

^ gene-expressing tissue may also be evaluated immunocytochemically or 

immunohistochemically using antibodies specific for the transgene product. 
20 Once the founder animals are produced, they may be bred, inbred, outbred, or 

crossbred to produce colonies of the particular animal. Examples of such breeding strategies 
include, but are not limited to: outbreeding of founder animals with more than one 
integration site in order to establish separate lines; inbreeding of separate lines in order to 
produce compound transgenics that express the transgene at higher levels because of the 
25 effects of additive expression of each transgene; crossing of heterozygous transgenic animals 

to produce animals homozygous for a given integration site in order to both augment 
expression and eliminate the need for screening of animals by DNA analysis; crossing of 
separate homozygous lines to produce compound heterozygous or homozygous lines; and 
breeding to place the transgene on a distinct background that is appropriate for an 
30 experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited to, 
animal model systems useful in elaborating the biological function of METHl and/or 
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METH2 polypeptides, studying conditions and/or disorders associated with aberrant METH 1 
and/or METH2 expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 



Example 33: METHl and/or METH2 Knock-Out Animals 



5 Endogenous METHl and/or METH2 gene expression can also be reduced by 

inactivating or "knocking out" the METHl and/or METH2 gene and/or its promoter using 
targeted homologous recombination. (E.g., see Smithies etaL, Nature 3i 7:230-234 (1985); 
Thomas &Capecchi,Ce//5i:503-512(1987);Thompson era/., C^//5:313-321 (1989); each 
of which is incorporated by reference herein in its entirety). For example, a mutant, non- 
10 functional polynucleotide of the invention (or a completely unrelated DNA sequence) 
flanked by DNA homologous to the endogenous polynucleotide sequence (either the coding 
regions or regulatory regions of the gene) can be used, with or without a selectable marker 
and/or a negative selectable marker, to transfect cells that express polypeptides of the 
invention in vivo. In another embodiment, techniques known in the art are used to generate 
1 5 knockouts in cells that contain, but do not express the gene of interest. Insertion of the DNA 
construct, via targeted homologous recombination, results in inactivation of the targeted 
gene. Such approaches are particularly suited in research and agricultural fields where 
modifications to embryonic stem cells can be used to generate animal offspring with an 
inactive targeted gene (e.g., see Thomas & Capecchi 1987 and Thompson 1989, supra), 
20 However this approach can be routinely adapted for use in humans provided the recombinant 
DNA constructs are directly administered or targeted to the required site in vivo using 
appropriate viral vectors that will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
25 not to express the polypeptides of the invention (e.g., knockouts) are administered to a 

patient in vivo. Such cells may be obtained from the patient (i.e., animal, including human) 
or an MHC compatible donor and can include, but are not limited to fibroblasts, bone 
marrow cells, blood cells (e.g., lymphocytes), adipocytes, muscle cells, endothelial cells etc. 
The cells are genetically engineered in vitro using recombinant DNA techniques to introduce 
30 the coding sequence of polypeptides of the invention into the cells, or alternatively, to 
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disrupt the coding sequence and/or endogenous regulatory sequence associated with the 
polypeptides of the invention, e.g., by transduction (using viral vectors, and preferably 
vectors that integrate the transgene into the cell genome) or transfection procedures, 
including, but not limited to, the use of plasmids, cosmids, YACs, naked DNA, 
5 electroporation, liposomes, etc. The coding sequence of the polypeptides of the invention 

can be placed under the control of a strong constitutive or inducible promoter or 
promoter/enhancer to achieve expression, and preferably secretion, of the et alMEYHX 
and/or METH2 polypeptides. The engineered cells which express and preferably secrete the 
polypeptides of the invention can be introduced into the patient systemically, e.g., in the 
C3 10 circulation, or intraperitoneally. 

ly Alternatively, the cells can be incorporated into a matrix and implanted in the body, 

J1 e.g., genetically engineered fibroblasts can be implanted as part of a skin graft; genetically 

engineered endothelial cells can be implanted as part of a lymphatic or vascular graft. (See, 
fp for example, Anderson et al U.S. Patent No. 5,399,349; and Mulligan & Wilson, U.S. 

15 Patent No. 5,460,959 each of which is incorporated by reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC compatible ceils , 
ly they can be administered using well known techniques which prevent the development of 

,7= a host immune response against the introduced cells. For example, the cells may be 

introduced in an encapsulated form which, while allowing for an exchange of components 
20 with the immediate extracellular environment, does not allow the introduced cells to be 

recognized by the host inamune system. 

Knock-out animals of the invention have uses which include, but are not limited to, 
animal model systems useful in elaborating the biological function of METHl and/or 
METH2 polypeptides, studying conditions and/or disorders associated with aberrant METH 1 
25 and/or METH2 expression, and in screening for compounds effective in ameliorating such 

conditions and/or disorders. 

Example 34: Assays Detecting Stimulation or Inhibition of B cell Proliferation and 
Differentiation 

Generation of functional humoral immune responses requires both soluble and 
30 cognate signaling between B-lineage cells and their microenvironment. Signals may impart 

a positive stimulus that allows a B-lineage cell to continue its programmed development, or 
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a negative stimulus that instructs the cell to arrest its current developmental pathway. To 
date, numerous stimulatory and inhibitory signals have been found to influence B cell 
responsiveness including JL-2, IL-4, IL-5, IL-6, IL-7, ILIO, IL-13, IL-14 and IL-15. 
Interestingly, these signals are by themselves weak effectors but can, in combination with 
5 various co-stimulatory proteins, induce activation, proliferation, differentiation, homing, 

tolerance and death among B cell populations. 

One of the best studied classes of B-cell co-stimulatory proteins is the TNF- 
superfamily . Within this family CD40, CD27 , and CD30 along with their respective ligands 
CD 154, CD70, and CD 153 have been found to regulate a variety of inamune responses. 
O 10 Assays which allow for the detection and/or observation of the proliferation and 

h\ differentiation of these B-cell populations and their precursors are valuable tools in 

determining the effects various proteins may have on these B-cell populations in terms of 
In proliferation and differentiation. Listed below are two assays designed to allow for the 

CO detection of the differentiation, proliferation, or inhibition of B-cell populations and their 

1^15 precursors. 

Ly In vitro Assay- Purified METH 1 and^r METH2 protein, or truncated forms thereof, 

is assessed for its ability to induce activation, proliferation, differentiation or inhibition 
and/or death in B-cell populations and their precursors. The activity of METHl and/or 
METH2 protein on purified human tonsillar B cells, measured qualitatively over the dose 

20 range from 0.1 to 10,000 ng/mL, is assessed in a standard B-lymphocyte co-stimulation 

assay in which purified tonsillar B cells are cultured in the presence of either formalin-fixed 
Staphylococcus aureus Cowan I (SAC) or immobilized anti-human IgM antibody as the 
priming agent. Second signals such as IL-2 and E.-15 synergize with SAC and IgM 
crosslinking to elicit B cell proliferation as measured by tritiated-thymidine incorporation. 

25 Novel synergizing agents can be readily identified using this assay. The assay involves 

isolating human tonsillar B cells by magnetic bead (MACS) depletion of CD3-positive cells. 
The resulting cell population is greater than 95% B cells as assessed by expression of 
CD45R(B220). 

Various dilutions of each sample are placed into individual wells of a 96-well plate to which 
30 are added 10^ B-cells suspended in culture medium (RPMI 1640 containing 10% FBS, 5 X 

lO'^M 2ME, lOOU/ml penicillin, lOug/ml streptomycin, and 10'^ dilution of SAC) in a total 
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volume of 150uL Proliferation or inhibition is quantitated by a 20h pulse (luCi/well) with 
3H-thymidine (6.7 Ci/mM) beginning 72h post factor addition. The positive and negative 
controls are IL2 and medium respectively. 

In vivo Assay- BALB/c mice are injected (i.p.) twice per day with buffer only, or 
5 2 mg/Kg of METHl and/or METH2 protein, or truncated forms thereof. Mice receive this 

treatment for 4 consecutive days, at which time they are sacrificed and various tissues and 
serum collected for analyses . Comparison of H&E sections from normal and METH 1 and/or 
METH2 protein-treated spleens identify the results of the activity of METHl and/or METH2 
\D protein on spleen cells, such as the diffusion of peri-arterial lymphatic sheaths, and/or 

jlO significant increases in the nucleated cellularity of the red pulp regions, which may indicate 

}^ the activation of the differentiation and proliferation of B-cell populations, 

in Immunohistochemical studies using a B cell marker, anti-CD45R(B220), are used to 

determine whether any physiological changes to splenic cells, such as splenic 
% disorganization, are due to increased B-cell representation within loosely defined B-cell 

f%5 zones that infiltrate established T-cell regions. 

Flow cytometric analyses of the spleens from METH 1 and/or METH2 protein-treated 
nnce is used to indicate whether METHl and/or METH2 protein specifically increases the 
proportion of ThB+, CD45R(B220)dull B cells over that which is observed in control mice. 
Likewise, a predicted consequence of increased mature B-cell representation in vivo 
20 is a relative increase in serum Ig titers. Accordingly, serum IgM and IgA levels are 

compared between buffer and METHl and/or METH2 protein-treated mice. 

The studies described in this example test activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METHl and/orMETH2 polynucleotides (e.g., gene therapy), agonists, and/or 
25 antagonists of METHl and/or METH2. 

Example 35: T Cell Proliferation Assay 

A CD3-induced proliferation assay is performed on PBMCs and is measured by the 
uptake of ^H-thymidine. The assay is performed as follows. Ninety-six well plates are 
coated with 100 |il/well of mAb to CD3 (HIT3a, Pharmingen) or isotype-matched control 
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mAb (B33.1) overnight at 4°C (1 mg/ml in .05M bicarbonate buffer, pH 9.5), then washed 
three times with PBS. PBMC are isolated by F/H gradient centrifugation from human 
peripheral blood and added to quadruplicate wells (5 x lOVwell) of mAb coated plates in 
RPMI containing 10% PCS and P/S in the presence of varying concentrations of et 
5 alMETUl and/or METH2 protein (total volume 200 (il). Relevant protein buffer and 

medium alone are controls. After 48 hr. culture at ST'C, plates are spun for 2 min. at 1000 
rpm and 100 jal of supernatant is removed and stored -20°C for measurement of IL-2 (or 
other cytokines) if effect on proliferation is observed. Wells are supplemented with 100 \il 
of medium containing 0.5 ]iCi of ^H-thymidine and cultured at SVC for 1 8-24 hr. Wells are 
^10 harvested and incorporation of ^H-thymidine used as a measure of proliferation. Anti-CD3 

^ alone is the positive control for proliferation. IL-2 (100 U/ml) is also used as a control 

I J which enhances proUferation. Control antibody which does not induce proliferation of T 

cells is used as the negative control for the effects of METHl and/or METH2 proteins, 
in The studies described in this example tested activity in METHl and^r METH2 

,^ 1 5 protein. However, one skilled in the art could easily modify the exemplified studies to test 

SJ. the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

1^^ antagonists of METHl and^r METH2. 

Example 36: Effect of METHl and/or METH2 on the Expression of MHC Class 11, 
Costimulatory and Adhesion Molecules and Cell Differentiation of Monocytes and 
20 Monocyte-Derived Human Dendritic Cells 

Dendritic cells are generated by the expansion of proliferating precursors found in 
the peripheral blood: adherent PBMC or elutriated monocytic fractions are cultured for 7-10 
days with GM-CSF (50 ng/ml) and IL-4 (20 ng/ml). These dendritic cells have the 
characteristic phenotype of inmiature cells (expression of CDl, CD80, CD86, CD40 and 
25 MHC class II antigens). Treatment with activating factors, such as TNF-a, causes a rapid 

change in surface phenotype (increased expression of MHC class I and U, costimulatory and 
adhesion molecules, downregulation of FCyRII, upregulation of CD83). These changes 
correlate with increased antigen-presenting capacity and with functional maturation of the 
dendritic cells. 

30 FACS analysis of surface antigens is performed as follows. Cells are treated 1-3 

days with increasing concentrations of METHl and/or METH2 or LPS (positive control). 
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washed with PBS containing 1% BS A and 0.02 mM sodium azide, and then incubated with 
1:20 dilution of appropriate FTTC- or PE-labeled monoclonal antibodies for 30 minutes at 
4°C. After an additional wash, the labeled cells are analyzed by flow cytometry on a 
FACScan (Becton Dickinson). 
5 Effect on the production of cytokines. Cytokines generated by dendritic cells, in 

particular IL-12, are important in the initiation of T-cell dependent inamune responses. IL- 
12 strongly influences the development of Thl helper T-cell immune response, and induces 
cytotoxic T and NK cell function. An ELISA is used to measure the IL-12 release as 
follows. Dendritic cells (10^/ml) are treated with increasing concentrations of METHl 
O 10 and/or METH2 for 24 hours. LPS (100 ng/ml) is added to the cell culture as positive 

I J control. Supernatants from the cell cultures are then collected and analyzed for IL-12 

/fi content using commercial ELISA kit (e..g, R & D Systems (Minneapolis, MN)). The 

standard protocols provided with the kits are used, 
Ig Effect on the expression of MHC Class II, costimulatory and adhesion 

1"^15 molecules. Three major families of cell surface antigens can be identified on monocytes: 

P:' adhesion molecules, molecules involved in antigen presentation, and Fc receptor. 

m Modulation of the expression of MHC class n antigens and other costimulatory molecules, 

. g such as B7 and ICAM-1, may result in changes in the antigen presenting capacity of 

monocytes and ability to induce T cell activation. Increase expression of Fc receptors may 
20 correlate with improved monocyte cytotoxic activity, cytokine release and phagocytosis. 

FACS analysis is used to examine the surface antigens as follows. Monocytes are 
treated 1-5 days with increasing concentrations of METHl and/or METH2 or LPS (positive 
control), washed with PBS containing 1% BSA and 0.02 mM sodium azide, and then 
incubated with 1 :20 dilution of appropriate FFTC- or PE-labeled monoclonal antibodies for 
25 30 minutes at 4''C. After an additional wash, the labeled cells are analyzed by flow 

cytometry on a FACScan (Becton Dickinson). 

Monocyte activation and/or increased survival. Assays for molecules that activate 
(or alternatively, inactivate) monocytes and/or increase monocyte survival (or alternatively, 
decrease monocyte survival) are known in the art and may routinely be applied to determine 
30 whether a molecule of the invention functions as an inhibitor or activator of monocytes. 

METHl and/or METH2, agonists, or antagonists of METHl and/or METH2 can be screened 
using the three assays described below. For each of these assays. Peripheral blood 
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mononuclear cells (PBMC) are purified from single donor leukopacks (American Red Cross, 
Baltimore, MD) by centrifugation through a Histopaque gradient (Sigma). Monocytes are 
isolated from PBMC by counterflow centrifugal elutriation. 

!• Monocyte Survival Assay. Human peripheral blood monocytes progressively lose 
5 viability when cultured in absence of serum or other stimuli. Their death results from 

internally regulated process (apoptosis). Addition to the culture of activating factors, such 
as TNF-alpha dramatically improves cell survival and prevents DNA fragmentation, 

Propidium iodide (PI) staining is used to measure apoptosis as follows. Monocytes 
are cultured for 48 hours in polypropylene tubes in serum-free medium (positive control), 
10 in the presence of 100 ng/ml TNF-alpha (negative control), and in the presence of varying 

concentrations of the compound to be tested. Cells are suspended at a concentration of 2 x 
10^/ml in PBS containing PI at a final concentration of 5 }ig/ml, and then incubated at room 
temperature for 5 minutes before FACScan analysis. PI uptake has been demonstrated to 
correlate with DNA fragmentation in this experimental paradigm. 

15 2. Effect on cytokine release. An important function of monocytes/macrophages is 

their regulatory activity on other cellular populations of the immune system through the 
release of cytokines after stimulation. An ELISA to measure cytokine release is performed 
as follows. Human monocytes are incubated at a density of 5x10^ cells/ml with increasing 
concentrations of METHl and/or METH2 and under the same conditions, but in the absence 

20 of METHl and/or METH2. For IL-12 production, the cells are primed overnight with IFN 

(100 U/ml) in presence of METHl and/or METH2. LPS (10 ng/ml) is then added. 
Conditioned media are collected after 24h and kept frozen until use. Measurement of TNF- 
alpha, lL-10, MCP-1 and IL-8 is then performed using a commercially available ELISA kit 
(e..g, R&D Systems (Minneapolis, MN)) and applying the standard protocols provided with 

25 the kit. 

3. Oxidative burst Purified monocytes are plated in 96- w plate at 2-1x10^ cell/well. 
Increasing concentrations of METH 1 and/or METH2 are added to the wells in a total volume 
of 0.2 ml culture medium (RPMI 1640 + 10% FCS, glutamine and antibiotics). After 3 days 
incubation, the plates are centrifuged and the medium is removed from the wells. To the 
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macrophage monolayers, 0,2 ml per well of phenol red solution (140 mM NaCl, 10 mM 
potassium phosphate buffer pH 7.0, 5.5 mM dextrose, 0.56 mM phenol red and 19 U/ml of 
HRPO) is added, together with the stimulant (200 nM PMA). The plates are incubated at 
37°C for 2 hours and the reaction is stopped by adding 20 jil IN NaOH per well. The 
5 absorbance is read at 610 nm. To calculate the amount of H2O2 produced by the 

macrophages, a standard curve of a H2O2 solution of known molarity is performed for each 
experiment. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
10 the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

hi antagonists of METHl and^r METH2, 

EH Example 37: METHl and/or METH2 Biological Effects 

f3 Astrocyte and Neuronal Assays. Recombinant METHl and/or METH2, expressed 

J '^^ in Escherichia coli and purified as described above, can be tested for activity in promoting 

yl5 the survival, neurite outgrowth, or phenotypic differentiation of cortical neuronal cells and 

J] for inducing the proliferation of glial fibrillary acidic protein immunopositive cells, 

astrocytes. The selection of cortical cells for the bioassay is based on the prevalent 
expression of FGF-1 and FGF-2 in cortical structures and on the previously reported 
enhancement of cortical neuronal survival resulting from FGF-2 treatment. A thymidine 
20 incorporation assay, for example, can be used to elucidate METH 1 and/or METH2 ' s activity 

on these cells. 

Moreover, previous reports describing the biological effects of FGF-2 (basic FGF) 
on cortical or hippocampal neurons in vitro have demonstrated increases in both neuron 
survival and neurite outgrowth (Walicke, P. et al, * 'Fibroblast growth factor promotes 

25 survival of dissociated hippocampal neurons and enhances neurite extension." Proc. Natl. 

Acad. Scl f/SA 53:3012-3016. (1986), assay herein incorporated by reference in its entirety). 
However, reports from experiments done on PC-12 cells suggest that these two responses 
are not necessarily synonymous and may depend on not only which FGF is being tested but 
also on which receptor(s) are expressed on the target cells. Using the primary cortical 

30 neuronal culture paradigm, the ability of METHl and/or METH2 to induce neurite 



-369" 



outgrowth can be compared to the response achieved with FGF-2 using, for example, a 
thymidine incorporation assay. 

Fibroblast and endothelial cell assays. Human lung fibroblasts are obtained from 
Clonetics (San Diego, CA) and maintained in growth media from Clonetics. Dermal 
5 microvascular endothelial cells are obtained from Cell Applications (San Diego, CA). For 

proliferation assays, the human lung fibroblasts and dermal microvascular endothelial cells 
can be cultured at 5,000 cells/well in a 96-well plate for one day in growth medium. The 
cells are then incubated for one day in 0. 1 % BS A basal medium. After replacing the medium 
with fresh 0.1% BSA medium, the cells are incubated with the test proteins for 3 days. 
10 Alamar Blue (Alamar Biosciences, Sacramento, CA) is added to each well to a final 

fi concentration of 1 0% . The cells are incubated for 4 hr . Cell viability is measured by reading 

H in a CytoFluor fluorescence reader. For the PGE2 assays, the human lung fibroblasts are 

ffi cultured at 5,000 cells/well in a 96-well plate for one day. After a medium change to 0.1% 

J2 BSA basal medium, the cells are incubated with FGF-2 or METH 1 and/or METH2 with or 

Si 1 5 without E.- 1 a for 24 hours. The supernatants are collected and assayed for PGE2 by EIA kit 

;;0 (Cayman, Ann Arbor, MI). For the IL-6 assays, the human lung fibroblasts are cultured at 

f ^ 5,000 cells/well in a 96-well plate for one day. After a medium change to 0. 1 % BSA basal 

medium, the cells are incubated with FGF-2 or METHl and/or METH2 with or without EL- 
la for 24 hours. The supernatants are collected and assayed for IL-6 by ELISA kit 
20 (Endogen, Cambridge, MA). 

Human lung fibroblasts are cultured with FGF-2 or METH I and/or METH2 for 3 
days in basal medium before the addition of Alamar Blue to assess effects on growth of the 
fibroblasts. FGF-2 should show a stimulation at 10 - 2500 ng/ml which can be used to 
compare stimulation with METHl and/or METH2. 
25 Parkinson Models. The loss of motor function in Parkinson's disease is attributed 

to a deficiency of striatal dopamine resulting from the degeneration of the nigrostriatal 
dopaminergic projection neurons. An animal model for Parkinson's that has been 
extensively characterized involves the systemic administration of 1 -methyl-4 phenyl 1 ,2,3,6- 
tetrahydropyridine (MPTP). In the CNS, MPTP is taken-up by astrocytes and catabolized 
30 by monoamine oxidase B to l-methyl-4-phenyl pyridine (MPP"") and released. 

Subsequently, MPP'' is actively accumulated in dopaminergic neurons by the high-affinity 
reuptake transporter for dopamine. MPP"" is then concentrated in mitochondria by the 
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electrochemical gradient and selectively inhibits nicotidamide adenine disphosphate: 
ubiquinone oxidoreductionase (complex I), thereby interfering with electron transport and 
eventually generating oxygen radicals. 

It has been demonstrated in tissue culture paradigms that FGF-2 (basic FGF) has 
5 trophic activity towards nigral dopaminergic neurons (Ferrari et ah. Dev. Biol. 1989). 

Recently, Dr. Unsicker's group has demonstrated that administering FGF-2 in gel foam 
implants in the striatum results in the near complete protection of nigral dopaminergic 
neurons from the toxicity associated with MPTP exposure (Otto and Unsicker, J. 
Neuroscience, 1990). 

10 Based on the data with FGF-2, METHl and/or METH2 can be evaluated to 

determine whether it has an action sinfiilar to that of FGF-2 in enhancing dopaminergic 
neuronal survival in vitro and it can also be tested in vivo for protection of dopaminergic 
neurons in the striatum from the damage associated with MPTP treatment. The potential 
effect of METHl and/or METH2 is first examined in vitro in a dopaminergic neuronal cell 

15 culture paradigm. The cultures are prepared by dissecting the midbrain floor plate from 

gestation day 14 Wistar rat embryos. The tissue is dissociated with trypsin and seeded at a 
density of 200,000 cells/cm^ on polyorthinine-laminin coated glass coverslips. The ceils are 
maintained in Dulbecco's Modified Eagle' s medium and F 12 medium containing hormonal 
supplements (Nl). The cultures are fixed with paraformaldehyde after 8 days in vitro and 

20 are processed for tyrosine hydroxylase, a specific marker for dopminergic neurons, 

immunohistochemical staining. Dissociated cell cultures are prepared from embryonic rats. 
The culture medium is changed every third day and the factors are also added at that time. 

Since the dopaminergic neurons are isolated from animals at gestation day 14, a 
developmental time which is past the stage when the dopaminergic precursor cells are 

25 proliferating, an increase in the number of tyrosine hydroxylase immunopositive neurons 

would represent an increase in the number of dopaminergic neurons surviving in vitro. 
Therefore, if METHl and/or METH2 acts to prolong the survival of dopaminergic neurons, 
it would suggest that METHl and/or METH2 may be involved in Parkinson's Disease. 

The studies described in this example tested activity in METHl and/or METH2 

30 protein. However, one skilled in the art could easily modify the exemplified studies to test 

the activity of METHl and/orMETH2 polynucleotides (e.g., gene therapy), agonists, and/or 
antagonists of METHl andyorMETH2. 



-371- 



Example 38: The Effect of METHl and/or METH2 on the Growth of Vascular 
Endothelial Cells 

On day 1, human umbilical vein endothelial cells (HUVEC) are seeded at 2-5x10"^ 
cells/35 mm dish density in M199 medium containing 4% fetal bovine serum (FBS), 16 
5 units/ml heparin, and 50 units/ml endothelial cell growth supplements (ECGS , Biotechnique, 

Inc.). On day 2, the medium is replaced with Ml 99 containing 10% FBS, 8 units/ml 
heparin. METHl and/or METH2 protein of SEQ ID NO. 2, and positive controls, such as 
VEGF and basic FGF (bFGF) are added, at varying concentrations. On days 4 and 6, the 
medium is replaced. On day 8, cell number is determined with a Coulter Counter. 
10 An increase in the number of HUVEC cells indicates that METHl and/or METH2 

may proliferate vascular endothelial cells. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
1 5 antagonists of METH 1 and/or METH2. 

Example 39: Stimulatory Effect of METHl and/or METH2 on the Proliferation of 
Vascular Endothelial Cells 

For evaluation of mitogenic activity of growth factors, the colorimetric MTS (3-(4,5- 
dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)2H-tetrazolium) assay 

20 with the electron coupling reagent PMS (phenazine methosulfate) is performed (CellTiter 

96 AQ, Promega). Cells are seeded in a 96-well plate (5,000 cells/well) in 0.1 mL serum- 
supplemented medium and are allowed to attach overnight. After serum-starvation for 12 
hours in 0.5% FBS, conditions (bFGF, VEGFi^s or METHl and/or METH2 in 0.5% FBS) 
with or without Heparin (8 U/ml) are added to wells for 48 hours. 20 mg of MTS/PMS 

25 mixture (1:0.05) are added per well and allowed to incubate for 1 hour at 37°C before 

measuring the absorbance at 490 nm in an ELISA plate reader. Background absorbance 
from control wells (some media, no cells) is subtracted, and seven wells are performed in 
parallel for each condition. See, Leak et al in vitro Cell Dev. Biol 50A.*512-518 (1994). 
The studies described in this example tested activity in METHl and/or METH2 

30 protein. However, one skilled in the art could easily modify the exemplified studies to test 
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the activity of METH 1 and/ or METH2 polynucleotides (e.g. , gene therapy) , agonists , and/or 
antagonists of METHl and/or METH2. 

Example 40: Inhibition of PDGF -induced Vascular Smooth Muscle Cell Proliferation 
Stimulatory Effect 

5 HAoSMC proliferation can be measured, for example, by BrdUrd incorporation. 

Briefly, subconfluent, quiescent cells grown on the 4-chamber slides are transfected with 
CRP or FITC-labeled AT2-3LP. Then, the cells are pulsed with 10% calf serum and 6 
mg/ml BrdUrd. After 24 h, immunocytochemistry is performed by using BrdUrd Staining 
Kit (Zymed Laboratories). In brief, the cells are incubated with the biotinylated mouse anti- 
:;^110 BrdUrd antibody at 4 ""C for 2 h after being exposed to denaturing solution and then 

iy incubated with the streptavidin-peroxidase and diaminobenzidine. After counterstaining 

y i with hematoxylin, the cells are mounted for microscopic examination, and the BrdUrd- 

positive cells are counted. The BrdUrd index is calculated as a percent of the BrdUrd- 
positive cells to the total cell number. In addition, the simultaneous detection of the BrdUrd 
1=-^! 5 staining (nucleus) and the FITC uptake (cytoplasm) is performed for individual cells by the 

% concomitant use of bright field illumination and dark field-UV fluorescent illumination, 

^fl See, Hayashida et al, J. Biol. Chem. 6:271(36):21985-21992 (1996). 

The studies described in this example tested activity in METHl and/or METH2 protein. 
However, one skilled in the art could easily modify the exemplified studies to test the 
20 activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

antagonists of METHl and/or METH2. 

Example 41: Stimulation of Endothelial Migration 

This example will be used to explore the possibility that METHl and/or METH2 
may stimulate lymphatic endothelial cell migration. 
25 Endothelial cell migration assays are performed using a 48 well microchemotaxis 

chamber (Neuroprobe Inc., Cabin John, MD; Falk, W., et al, J. Immunological Methods 
1 980;33 :239-247). Polyvinylpyrrolidone-free polycarbonate filters with a pore size of 8 um 
(Nucleopore Corp. Cambridge, MA) are coated with 0.1% gelatin for at least 6 hours at 
room temperature and dried under sterile air. Test substances are diluted to appropriate 
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concentrations in M 199 supplemented with 0.25% bovine serum albumin (BSA), and 25 ul 
of the final dilution is placed in the lower chamber of the modified Boyden apparatus. 
Subconfluent, early passage (2-6) HUVEC or BMEC cultures are washed and trypsinized 
for the minimum time required to achieve cell detachment. After placing the filter between 
5 lower and upper chamber, 2.5 x 10^ cells suspended in 50 ul M199 containing 1% FBS are 

seeded in the upper compartment. The apparatus is then incubated for 5 hours at 37°C in 
a humidified chamber with 5% C02 to allow cell migration. After the incubation period, 
the filter is removed and the upper side of the filter with the non-migrated cells is scraped 
with a rubber policeman. The filters are fixed with methanol and stained with a Giemsa 
10 solution (Diff-Quick, Baxter, McGraw Park, XL). Migration is quantified by counting cells 

^£ of three random high-power fields (40x) in each well, and all groups are performed in 

UJ quadruplicate. 

l^l The studies described in this example tested activity in METHl and/or METH2 

Jt; protein. However, one skilled in the art could easily modify the exemplified studies to test 

CQ15 the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

n antagonists of METHl and/or METH2, 

iy Example 42: Stimulation of Nitric Oxide Production by Endothelial Cells 

Nitric oxide released by the vascular endothelium is believed to be a mediator of 
vascular endothelium relaxation. Thus, METHl and/or METH2 activity can be assayed by 
20 determining nitric oxide production by endothelial cells in response to METHl and/or 

METH2. 

Nitric oxide is measured in 96-well plates of confluent microvascular endothelial 
cells after 24 hours starvation and a subsequent 4 hr exposure to various levels of a positive 
control (such as VEGF-1) and METHl and/or METH2, Nitric oxide in the medium is 
25 determined by use of the Griess reagent to measure total nitrite after reduction of nitric 

oxide-derived nitrate by nitrate reductase. The effect of METHl and/or METH2 on nitric 
oxide release is examined on HUVEC. 

Briefly, NO release from cultured HUVEC monolayer is measured with a NO- 
specific polarographic electrode connected to a NO meter (Iso-NO, World Precision 
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Instruments Inc.) (1049). Calibration of the NO elements is performed according to the 
following equation: 

2 KNO2 + 2 KI + 2 H2SO4 6 2 NO + I2 + 2 H^O + 2 K2SO4 
The standard calibration curve is obtained by adding graded concentrations of KNO2 (0, 5, 
5 10, 25, 50, 100, 250, and 500 nmol/L) into the calibration solution containing KI and H2SO4. 

The specificity of the Iso-NO electrode to NO is previously determined by measurement of 
NO from authentic NO gas (1050). The culture medium is removed and HUVECs are 
washed twice with Dulbecco's phosphate buffered saline. The cells are then bathed in 5 ml 
of filtered Krebs-Henseleit solution in 6-well plates, and the cell plates are kept on a slide 
CSIO warmer (Lab Line Mstruments Inc.) to maintain the temperature at 37*^C. The NO sensor 

y probe is inserted vertically into the wells, keeping the tip of the electrode 2 mm under the 

.^1 surface of the solution, before addition of the different conditions, S-nitroso acetyl 

penicillamin (SNAP) is used as a positive control. The amount of released NO is expressed 
£0 as picomoles per 1x10^ endothelial cells. All values reported are means of four to six 

;4 5 measurements in each group (number of cell culture wells). See, Leak et al. Biochem. and 

M Biophys. Res. Comm. 277:96-105 (1995). 

bl The studies described in this example tested activity in METHl andMr METH2 

3 protein. However, one skilled in the art could easily modify the exemplified studies to test 

the activity of METH 1 and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
20 antagonists of METHl and/or METH2. 

Example 43: Effect of METHl and/or METHl on Cord Formation in Angiogenesis 

Another step in angiogenesis is cord formation, marked by differentiation of 
endothelial cells. This bioassay measures the ability of microvascular endothelial cells to 
form capillary-like structures (hollow structures) when cultured in vitro. 

25 CADMEC (microvascular endothelial cells) are purchased from Cell Applications, 

Inc. as proliferating (passage 2) cells and are cultured in Cell Applications' CADMEC 
Growth Medium and used at passage 5. For the in vitro angiogenesis assay, the wells of a 
48- well cell culture plate are coated with Cell Applications' Attachment Factor Medium (200 
ml/well) for 30 min. at 37^C. CADMEC are seeded onto the coated wells at 7,500 cells/well 

30 and cultured overnight in Growth Medium. The Growth Medium is then replaced with 300 
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mg Cell Applications' Chord Formation Medium containing control buffer or METHI 
and/or METH2 (0.1 to 100 ng/ml) and the cells are cultured for an additional 48 hr. The 
numbers and lengths of the capillary-like chords are quantitated through use of the Boeckeler 
VIA-170 video image analyzer. All assays are done in triplicate. 
5 Conamercial (R&D) VEGF (50 ng/ml) is used as a positive control, b-esteradiol (1 

ng/ml) is used as a negative control. The appropriate buffer (without protein) is also utilized 
as a control. 

The studies described in this example tested activity in METHI and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
1 0 the activity of METH 1 and/or METH2 polynucleotides (e.g. , gene therapy) , agonists, and/or 

antagonists of METHI and/or METH2. 

Example 44: Rescue of Ischemia in Rabbit Lower Limb Model 

To study the in vivo effects of METHI and/or METH2 on ischemia, a rabbit 
hindlimb ischemia model is created by surgical removal of one femoral arteries as described 

15 previously (Takeshita, S. et aL, Am /. Pathol 147:1649-1660 (1995)). The excision of the 

femoral artery results in retrograde propagation of thrombus and occlusion of the external 
iliac artery. Consequently, blood flow to the ischemic limb is dependent upon collateral 
vessels originating from the internal iliac artery (Takeshita, S . et al Am J, Pathol 147:1 649- 
1660 (1995)). An interval of 10 days is allowed for post-operative recovery of rabbits and 

20 development of endogenous collateral vessels. At 10 day post-operatively (day 0), after 

performing a baseline angiogram, the internal iliac artery of the ischemic limb is transfected 
with 500 mg naked METHI and/or METH2 expression plasmid by arterial gene transfer 
technology using a hydrogel-coated balloon catheter as described (Riessen, R. et al. Hum 
Gene Ther, 4:1A9-15% (1993); Leclerc, G. etal J. Clin. Invest. 90: 936-944 (1992)). When 

25 METHI and/or METH2 is used in the treatment, a single bolus of 500 mg METHI and/or 

METH2 protein or control is delivered into the internal iliac artery of the ischemic limb over 
a period of 1 min. through an infusion catheter. On day 30, various parameters are measured 
in these rabbits: (a) BP ratio - The blood pressure ratio of systolic pressure of the ischemic 
limb to that of normal limb; (b) Blood Flow and Flow Reserve - Resting FL: the blood flow 

30 during undilated condition and Max FL: the blood flow during fully dilated condition (also 
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an indirect measure of the blood vessel amount) and Flow Reserve is reflected by the ratio 
of max FL: resting PL; (c) Angiographic Score - This is measured by the angiogram of 
collateral vessels, A score is determined by the percentage of circles in an overlaying grid 
that with crossing opacified arteries divided by the total number m the rabbit thigh; (d) 
5 Capillary density - The number of collateral capillaries determined in light microscopic 

sections taken from hindlimbs. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
1 0 antagonists of METH 1 and/or METH2. 



Example 45: Effect of METHl and/or METH2 on Vasodilation 



Since dilation of vascular endothelium is important in reducing blood pressure, the 
ability of METH 1 and/or METH2 to affect the blood pressure in spontaneously hypertensive 
rats (SHR) is examined. Increasing doses (0, 10, 30, 100, 300, and 900 mg/kg) of the 
15 METHl and/or METH2 are administered to 13- 14 week old spontaneously hypertensive rats 

(SHR). Data are expressed as the mean +/- SEM. Statistical analysis are performed with 
a paired t-test and statistical significance is defined as p<0.05 vs. the response to buffer 
alone. 

The studies described in this example tested activity in METHl and/or METH2 
20 protein. However, one skilled in the art could easily modify the exemplified studies to test 

the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
antagonists of METHl and/or METH2. 



Example 46: Rat Ischemic Skin Flap Model 



The evaluation parameters include skin blood flow , skin temperature, and factor VIH 
25 immunohistochemistry or endothehal alkaline phosphatase reaction. METHl and/or 

METH2 expression, during the skin ischemia, is studied using in situ hybridization. 
The study in this model is divided into three parts as follows: 
a) Ischemic skin 
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b) Ischemic skin wounds 

c) Normal wounds 
The experimental protocol includes: 

a) Raising a 3x4 cm, single pedicle full-thickness random skin flap (myocutaneous flap over 
5 the lower back of the animal). 

b) An excisional wounding (4-6 mm in diameter) in the ischemic skin (skin-flap). 

c) Topical treatment with METHl and/or METH2 of the excisional wounds (day 0, 1, 2, 3, 
4 post- wounding) at the following various dosage ranges: Img to 100 mg. 

d) Harvesting the wound tissues at day 3, 5, 7, 10, 14 and 21 post- wounding for histological, 
10 immunohistochemical, and in situ studies. 

y The studies described in this example tested activity in METHl and/or METH2 

Ly protein. However, one skilled in the art could easily modify the exemplified studies to test 

1 ,1 the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

JJ] antagonists of METH 1 and/or METH2. 

5 Example 47: Peripheral Arterial Disease Model 

Ld Angiogenic therapy using METH 1 and/or METH2 is a novel therapeutic strategy to 

obtain restoration of blood flow around the ischemia in case of peripheral arterial diseases. 
The experimental protocol includes: 

a) One side of the femoral artery is ligated to create ischemic muscle of 
20 the hindlimb, the other side of hindlimb serves as a control. 

b) METH 1 and/or METH2 protein, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-3 weeks. 

c) The ischemic muscle tissue is collected after ligation of the femoral artery at 1 , 2, 
and 3 weeks for the analysis of METHl and/or METH2 expression and histology. Biopsy 

25 is also performed on the other side of normal muscle of the contralateral hindhmb. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METHl and/or METH2 polynucleotides (e.g. , gene therapy), agonists, and/or 
antagonists of METHl and/or METH2. 
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Example 48: Ischemic Myocardial Disease Model 

METHl and/or METH2 is evaluated as a potent mitogen capable of stimulating the 
development of collateral vessels, and restructuring new vessels after coronary artery 
occlusion. Alteration of METHl and/or METH2 expression is investigated in situ. The 
5 experimental protocol includes: 

a) The heart is exposed through a left-side thoracotomy in the rat. Immediately, the 
left coronary artery is occluded with a thin suture (6-0) and the thorax is closed. 

b) METHl and/or METH2 protein, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-4 weeks. 

10 c) Thirty days after the surgery, the heart is removed and cross-sectioned for 

morphometric and in situ analyzes. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METH 1 and/or METH2 polynucleotides (e.g. , gene therapy) , agonists, and/or 

1 5 antagonists of METH 1 and/or METH2. 

Example 49: Rat Corneal Wound Healing Model 

This animal model shows the effect of METHl and/or METH2 on 
neovascularization. The experimental protocol includes: 

a) Making a 1-1.5 nmi long incision from the center of cornea into the stromal 

20 layer. 

b) Inserting a spatula below the lip of the incision facing the outer corner of the 

eye. 

c) Making a pocket (its base is 1-1.5 mm form the edge of the eye). 

d) Positioning a pellet, containing 50ng- 5ug of METHl and/or METH2, within the 

25 pocket. 

e) METH 1 and/or METH2 treatment can also be applied topically to the corneal 
wounds in a dosage range of 20mg - 500mg (daily treatment for five days). 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
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the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
antagonists of METHl and/or METH2. 

Example 50: Diabetic Mouse and Glucocorticoid-Impaired Wound Healing Models 
A. Diabetic db+/db+ Mouse Model 
5 To demonstrate that METHl and/or METH2 has an effect on the healing process, 

the genetically diabetic mouse model of wound healing is used. The full thickness wound 
healing model in the db+/db+ mouse is a well characterized, clinically relevant and 
reproducible model of impaired wound healing. Healing of the diabetic wound is dependent 
on formation of granulation tissue and re-epithelialization rather than contraction (Gartner, 
O 10 M.H. et al, J, Surg. Res, 52:389 (1992); Greenhalgh, D.G. et al, Am, J, Pathol 136:1235 

W (1990)). 

1 ,1 The diabetic animals have many of the characteristic features observed in Type n 

diabetes mellitus. Homozygous (db+/db+) mice are obese in comparison to their normal 
CO heterozygous (db+/+m) littermates. Mutant diabetic (dbH-/db+) mice have a single 

f 5 1 5 autosomal recessive mutation on chromosome 4 (db+) (Coleman et al. Proa NatL Acad, Set 

ff USA 77:283-293 (1982)). Animals show polyphagia, polydipsia and polyuria. Mutant 

UJ diabetic mice (db+/ db+) have elevated blood glucose, increased or normal insulin levels , and 

suppressed cell-mediated inmiunity (Mandel etal, J. Immunol 120:1375 (1978); Debray- 
Sachs, M. et al, Clin, Exp, Immunol 51(1 ):l-7 (1983); Leiter et al, Am, J, of Pathol 
20 114:46-55 (1985)). Peripheral neuropathy, myocardial complications, and microvascular 

lesions, basement membrane thickening and glomerular filtration abnormalities have been 
described in these animals (Norido,F. etal, Exp, Neurol 83(2 ):22l-232 (1984); Robertson 
et al. Diabetes 29(1 ):60-67 (1980); Giacomelli et al, Lab Invest. 40(4):460-473 (1979); 
Coleman, D.L. , Diabetes 31 ( Suppl): 1-6(1 982)), These homozygous diabetic mice develop 
25 hyperglycemia that is resistant to insulin analogous to human type n diabetes (Mandel et al , 

J. Immunol 720:1375-1377 (1978)). 

The characteristics observed in these animals suggests that healing in this model may be 
similar to the healing observed in human diabetes (Greenhalgh, et al, Am, J. of Pathol 
136:1235-1246 (1990)). 

30 Genetically diabetic female C57BL/KsJ (db+/db+) mice and their non-diabetic 

(db+/-i-m) heterozygous littermates are used in this study (Jackson Laboratories). The 
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animals are purchased at 6 weeks of age and are 8 weeks old at the beginning of the study. 
Animals are individually housed and received food and water ad libitum. All manipulations 
are performed using aseptic techniques. The experiments are conducted according to the 
rules and guidelines of Human Genome Sciences, Inc. Institutional Animal Care and Use 
5 Committee and the Guidelines for the Care and Use of Laboratory Animals. 

Wounding protocol is performed according to previously reported methods (Tsuboi, 
R. and Rifkin, D.B,, /. Exp, Med. 772:245-251 (1990)), Briefly, on the day of wounding, 
animals are anesthetized with an intraperitoneal injection of Avertin (0.01 mg/mL), 2,2,2- 
tribromoethanol and 2-methyl-2-butanol dissolved in deionized water. The dorsal region 
10 of the animal is shaved and the skin washed with 70% ethanol solution and iodine. The 

\ surgical area is dried with sterile gauze prior to wounding. An 8 mm full-thickness wound 

I is then created using a Ivcyes tissue punch. Immediately following wounding, the 

I surrounding skin is gently stretched to eliminate wound expansion. The wounds are left 

1 open for the duration of the experiment. Application of the treatment is given topically for 

15 5 consecutive days commencing on the day of wounding. Prior to treatment, wounds are 

t gently cleansed with sterile saline and gauze sponges. 

=^ Wounds are visually examined and photographed at a fixed distance at the day of 

g surgery and at two day intervals thereafter. Wound closure is determined by daily 

measurement on days 1-5 and on day 8. Wounds are measured horizontally and vertically 
20 using a calibrated Jameson caliper. Wounds are considered healed if granulation tissue is 

no longer visible and the wound is covered by a continuous epithelium. 

METHl and/or METH2 is administered using at a range different doses of METHl 
and/or METH2, from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle 
control groups received 50mL of vehicle solution. 
25 Animals are euthanized on day 8 with an intraperitoneal injection of sodium 

pentobarbital (300mg/kg). The wounds and surrounding skin are then harvested for 
histology and immunohistochemistry. Tissue specimens are placed in 10% neutral buffered 
formalin in tissue cassettes between biopsy sponges for further processing. 

Three groups of 10 animals each (5 diabetic and 5 non-diabetic controls) are 
30 evaluated: 1) Vehicle placebo control, 2) METHl and/or 3) METH2. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total square area of the wound. Contraction is then estimated by 
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establishing the differences between the initial wound area (day 0) and that of post treatment 
(day 8), The wound area on day 1 is 64mm^, the corresponding size of the dermal punch. 
Calculations are made using the following formula: 

[Open area on day 8] - [Open area on day 1] / [Open area on day 11 

5 Specimens are fixed in 10% buffered formalin and paraffin embedded blocks are sectioned 

perpendicular to the wound surface (5mm) and cut using a Reichert-Jung microtome. 
Routine hematoxylin-eosin (H&E) staining is performed on cross-sections of bisected 
wounds. Histologic examination of the wounds are used to assess whether the healing 
S3 process and the morphologic appearance of the repaired skin is altered by treatment with 

QIO METHl and/or METH2. This assessment included verification of the presence of cell 

, 7; accumulation, inflammatory cells, capillaries, fibroblasts, re-epithelialization and epidermal 

Cn maturity (Greenhalgh, D.G. et al, Am, / Pathol 136:1235 (1990)). A calibrated lens 

ly micrometer is used by a blinded observer, 

l^^ Tissue sections are also stained immunohistochemicaily with a polyclonal rabbit 

El 5 anti-human keratin antibody using ABC Elite detection system. Human skin is used as a 

Ul positive tissue control while non-inomune IgG is used as a negative control. Keratinocyte 

% growth is determined by evaluating the extent of reepitheliahzation of the wound using a 

calibrated lens micrometer. 

Proliferating cell nuclear antigen/cyclin (PCNA) in skin specimens is demonstrated 
20 by using anti-PCNA antibody (1:50) with an ABC Elite detection system. Human colon 

cancer served as a positive tissue control and human brain tissue is used as a negative tissue 
control. Each specimen included a section with omission of the primary antibody and 
substitution with non-inmiune mouse IgG, Ranking of these sections is based on the extent 
of proliferation on a scale of 0-8, the lower side of the scale reflecting slight proliferation 
25 to the higher side reflecting intense proliferation. 

Experimental data are analyzed using an unpaired t test. A p value of < 0.05 is 
considered significant. 
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B. Steroid Impaired Rat Model 

The inhibition of wound healing by steroids has been well documented in various 
in vitro and in vivo systems (Wahl, S.M. Glucocorticoids and Wound healing. In: Anti- 
Inflammatory Steroid Action: Basic and Clinical Aspects. 280-302 (1989); Wahl, SM.et 
5 aL, /. Immunol 115: 476-481 (1975); Werb, Z etaL^J, Exp, Med, 147: 1684-1694 (1978)). 

Glucocorticoids retard wound healing by inhibiting angiogenesis, decreasing vascular 
permeability ( Ebert, R.H., et al. An, Intern, Med. 37:701-705 (1952)), fibroblast 
proliferation, and collagen synthesis (Beck, L.S. et al. Growth Factors, 5: 295-304 
(1991); Haynes, B.F. et al, J, Clin, Invest, 61: 703-797 (1978)) and producing a transient 
10 reduction of circulating monocytes (Haynes, B.F., etalj, Clin, Invest, 61: 703-797 (1978); 

Wahl, S. M., "Glucocorticoids and wound healing". In: Antiinflammatory Steroid Action: 
^4 Basic and Clinical Aspects, Academic Press, New York, pp. 280-302 (1989)). The systemic 

p administration of steroids to impaired wound healing is a well establish phenomenon in rats 

Jji (Beck, L.S. et al„ Growth Factors, 5: 295-304 (1991); Haynes, B.F., et al, J. Clin, 

^ Invest, 61: 703-797 (1978); Wahl, S. M„ "Glucocorticoids and wound healing", In: 

m Antiinflammatory Steroid Action: Basic and Clinical Aspects, Academic Press, New York, 

□ pp. 280-302 (1989); Pierce, G.R et al, Proc, Natl Acad. Sci, USA 86: 2229-2233 (1989)), 

To demonstrate that METHl and/or METH2 has an effect on the healing process, 
the effects of multiple topical applications of METHl and/or METH2 on full thickness 
20 excisional skin wounds in rats in which healing has been impaired by the systemdc 

administration of methylprednisolone is assessed. 

Young adult male Sprague Dawley rats weighing 250-300 g (Charles River 
Laboratories) are used in this example. The animals are purchased at 8 weeks of age and are 
9 weeks old at the beginning of the study. The healing response of rats is impaired by the 
25 systemic administration of methylprednisolone ( 17mg/kg/rat intramuscularly) at the time of 

wounding. Animals are individually housed and received food and water ad libitum. All 
manipulations are performed using aseptic techniques. This study is conducted according 
to the rules and guidelines of Human Genome Sciences, Inc. Institutional Animal Care and 
Use Committee and the Guidelines for the Care and Use of Laboratory Animals, 
30 The wounding protocol is followed according to section A, above. On the day of 

wounding, animals are anesthetized with an intramuscular injection of ketamine (50 mg/kg) 
and xylazine (5 mg/kg). The dorsal region of the animal is shaved and the skin washed with 
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70% ethanol and iodine solutions. The surgical area is dried with sterile gauze prior to 
wounding. An 8 mm full-thickness wound is created using a Keyes tissue punch. The 
wounds are left open for the duration of the experiment. Applications of the testing 
materials are given topically once a day for 7 consecutive days commencing on the day of 
5 wounding and subsequent to methylprednisolone administration. Prior to treatment, wounds 

are gently cleansed with sterile saline and gauze sponges. 

Wounds are visually examined and photographed at a fixed distance at the day of 
wounding and at the end of treatment. Wound closure is determined by daily measurement 
on days 1-5 and on day 8, Wounds are measured horizontally and vertically using a 
10 calibrated Jameson caliper. Wounds are considered healed if granulation tissue is no longer 

D visible and the wound is covered by a continuous epithelium. 

I y METH 1 and/or METH2 is administered using at a range different doses of METH 1 

and/or METH2, from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle 
control groups received 50mL of vehicle solution. 
cfflS Animals are euthanized on day 8 with an intraperitoneal injection of sodium 

pentobarbital (300mg/kg), The wounds and surrounding skin are then harvested for 
histology. Tissue specimens are placed in 10% neutral buffered formalin in tissue cassettes 
ly between biopsy sponges for further processing. 

Four groups of 10 animals each (5 with methylprednisolone and 5 without 
20 glucocorticoid) are evaluated: 1 ) Untreated group 2) Vehicle placebo control 3) METH 1 and 

4) METH2 treated groups. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total area of the wound. Closure is then estimated by establishing the 
differences between the initial wound area (day 0) and that of post treatment (day 8). The 
25 wound area on day 1 is 64mm^, the corresponding size of the dermal punch. Calculations 

are made using the following formula: 

[Open area on day 8] - [Open area on day 1] / [Open area on day 1] 

Specimens are fixed in 10% buffered formahn and paraffin embedded blocks are sectioned 
perpendicular to the wound surface (5mm) and cut using an Olympus microtome. Routine 
30 hematoxylin-eosin (H&E) staining is performed on cross-sections of bisected wounds. 
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Histologic examination of the wounds allows assessment of whether the healing process and 
the morphologic appearance of the repaired skin is improved by treatment with METHl 
and/or METH2. A calibrated lens micrometer is used by a blinded observer to determine the 
distance of the wound gap. 
5 Experimental data are analyzed using an unpaired t test. A p value of < 0.05 is considered 

significant. 

The studies described in this example tested activity in METHl and/or METH2 
protein. However, one skilled in the art could easily modify the exemplified studies to test 
the activity of METHl and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 
10 antagonists of METHl and/or METH2. 

r[ Example 51 : Lymphadema Animal Model 

In The purpose of this experimental approach is to create an appropriate and consistent 

lymphedema model for testing the therapeutic effects of METHl and/or METH2 in 
y lymphangiogenesis and re-establishment of the lymphatic circulatory system in the rat hind 

W 5 limb. Effectiveness is measured by swelling volume of the affected limb, quantification of 

"5 the amount of lymphatic vasculature, total blood plasma protein, and histopathology . Acute 

lymphedema is observed for 7-10 days. Perhaps more importantly, the chronic progress of 
the edema is followed for up to 3-4 weeks. 

Prior to beginning surgery, blood sample is drawn for protein concentration analysis. 
20 Male rats weighing approximately --SSOg are dosed with Pentobarbital. Subsequently, the 

right legs are shaved from knee to hip. The shaved area is swabbed with gauze soaked in 
70% EtOH. Blood is drawn for serum total protein testing. Circumference and volumetric 
measurements are made prior to injecting dye into paws after marking 2 measurement levels 
(0.5 cm above heel, at mid-pt of dorsal paw). The intradermal dorsum of both right and left 
25 paws are injected with 0.05 ml of 1% Evan's Blue. Circumference and volumetric 

measurements are then made following injection of dye into paws. 

Using the knee joint as a landmark, a mid-leg inguinal incision is made 
circumferentially allowing the femoral vessels to be located. Forceps and hemostats are 
used to dissect and separate the skin flaps. After locating the femoral vessels, the lymphatic 
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vessel that runs along side and underneath the vessel(s) is located. The main lymphatic 
vessels in this area are then electrically coagulated or suture ligated. 

Using a microscope, muscles in back of the leg (near the semitendinosis and 
adductors) are bluntly dissected. The popliteal lymph node is then located. The 2 proximal 
and 2 distal lymphatic vessels and distal blood supply of the popliteal node are then and 
ligated by suturing. The popliteal lymph node, and any accompanying adipose tissue, is then 
removed by cutting connective tissues. 

Care is taken to control any mild bleeding resulting from this procedure. After 
lymphatics are occluded, the skin flaps are sealed by using liquid skin (Vetbond) (AJ Buck). 
The separated skin edges are sealed to the underlying muscle tissue while leaving a gap of 
-0.5 cm around the leg. Skin also may be anchored by suturing to underlying muscle when 
necessary. 

To avoid infection, animals are housed individually with mesh (no bedding). 
Recovering animals are checked daily through the optimal edematous peak, which typically 
occurred by day 5-7. The plateau edematous peak are then observed. To evaluate the 
intensity of the lymphedema, the circumference and volumes of 2 designated places on each 
paw before operation and daily for 7 days are measured. The effect plasma proteins on 
lymphedema is determined and whether protein analysis is a useful testing perimeter is also 
investigated. The weights of both control and edematous limbs are evaluated at 2 places. 
Analysis is performed in a blind manner. 

Circumference Measurements: Under brief gas anesthetic to prevent limb 
movement, a cloth tape is used to measure limb circumference. Measurements are done at 
the ankle bone and dorsal paw by 2 different people then those 2 readings are averaged. 
Readings are taken from both control and edematous limbs. 

Volumetric Measurements: On the day of surgery, animals are anesthetized with 
Pentobarbital and are tested prior to surgery. For daily volumetrics animals are under brief 
halothane anesthetic (rapid immobilization and quick recovery), both legs are shaved and 
equally marked using waterproof marker on legs. Legs are first dipped in water, then dipped 
into instrument to each marked level then measured by Buxco edema sof tware(ChenA^ictor) . 
Data is recorded by one person, while the other is dipping the limb to marked area. 

Blood-plasma protein measurements: Blood is drawn, spun, and serum separated 
prior to surgery and then at conclusion for total protein and Ca2+ comparison. 
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Limb Weight Comparison: After drawing blood, the animal is prepared for tissue 
collection. The limbs are amputated using a quillitine, then both experimental and control 
legs are cut at the ligature and weighed. A second weighing is done as the tibio-cacaneal 
joint is disarticulated and the foot is weighed. 
5 Histological Preparations: The transverse muscle located behind the knee 

(popliteal) area is dissected and arranged in a metal mold, filled with freezeGel, dipped into 
cold methylbutane, placed into labeled sample bags at - 80EC until sectioning. Upon 
sectioning, the muscle is observed under fluorescent microscopy for lymphatics. 

The studies described in this example tested activity in METHl and/or METH2 
10 protein. However, one skilled in the art could easily modify the exemplified studies to test 

5 the activity of METH 1 and/or METH2 polynucleotides (e.g., gene therapy), agonists, and/or 

; J antagonists of METHl and^r METH2. 

in Example 52, Generation of constructs and expression of METHl 

^™ Two constructs having either a Flag peptide sequence or a human IgGl Fc domain 

[315 fused to the full-length METHl gene at its C-terminus were generated, using methods well 

Jl known in the art. The construct names, pFlag-CMV-5a:METHl (ID 822) and 

pC4Fc:METHl (ID 821) were assigned. 

The following primers were used for pFlag-CMV-5a:METHl : 

5': AAGAATGCGGCCGCAGCCACCATGGGGAACGCGGAGCGGGCTCC 
20 3': GATCGCGGTACCACTGCATTCTGCCATTGTGCAAAAGTCTATG 

METHl was amplified using the indicated primers, and digested with Asp7 18. The 

vector pFLAGCMV-5a was also digested with Asp7 18. The resulting restriction products 

were ligated together. 

The following primers were used for pC4Fc:METHl: 
25 5': GATCTATGATCAGCCACCATGGGGAACGCGGAGCGGGCTCC 

3': GACTGCTCTAGAACTGCATTCTGCCATTGTGCAAAAGTCTATG 

METHl was amplified using the indicated primers, and digested with BcH and Xba. 
The vector pC4Fc was also digested with BcH and Xba. The resulting restriction products 
were ligated together. 

30 Constructs pA2gp:METHl(H542-Q894),Fc and pA2gp:METHl(H542-Q894) can 

also be made. 
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Also, pC4Fc:Methl.Ml-P799 can be made using the following primers: 
5' primer: GATCTA TGATCA GCCACCATGGGGAACGCGGAGCGGGCTCC 
3' primer: GCGTGC TCTAGA AGGGCTAAAGCTGCGAATTC 

METHl is amplified using the indicated primers, and digested with Bell and Xba. 
5 The vector pC4Fc was also digested with Bell and Xba, and ligated to the digested METH 1 

fragment. 

pFLAG-CMV-l:Methl,F236-E614 can be made using the following primers: 
5' primer: GTACCC AAGCTT TTTGTGTCCAGTCACCGC 
3' primer: GCGTGC TCTAGA TTACTCGTTGTGTGCTTCAC 
10 METHl is amplified using the indicated primers and digested with Hind HI and 

£3 Xba. The vector pFLAG-CMV- 1 is also digested with Hind in and Xba and ligated to the 

|y digested METHl fragment. 

= The constructs were made in order to confirm the anti-angiogenesis activity of 

W METHl. The full length METHl gene was PGR cloned into pC4Fc and pFlagCMVSa 

C015 vectors. Both pC4Fc:METHl and pFLAGCMVSa: METHl were obtained and the sequence 

confirmed. 

Transient transfections on 293T cells were done using lipofectamine plus (LTI) 
id reagent and held for production under serum-free conditions. Western analysis was done 

with either anti-huFc Ab or anti-Flag M2 Ab, METHl-Fc conditioned media showed at 

20 least five bands with varying degree of intensity. Their estimated MWs are 130-140 

kD(weak), 110-120 kD(weak),52 kD(strong), 45-48 kD(strong) and 32-35 kD(strongest). 
Two weaker bands at about 60 and 90 kD were also detectable, METHl -Flag conditioned 
medium revealed three major bands with equal intensity. They are about 1 00- 11 0 kD, 70-80 
kD and 22 kD, Transient transfection of METHl-Fc in 293T cells. A second batch of 

25 METHl-Fc protein was produced in medium with 1% serum as described above. 

5.5 day conditioned medium from transiently transfected cells was run on a Protein A 
column and eluted. The fractions containing protein were examined by SDS-PAGE under 
reduced and non-reduced conditions and stained with Coomassie Blue. A second gel was 
also prepared for N-terminal sequence analysis. 

30 197 |L[g of protein were recovered which demonstrated 4 bands under reducing 

conditions. Three of the bands were strong, one was weak. N-terminal sequencing of the 
bands suggested that 2 of the bands contained proteins with a blocked N-termini. Of the 2 
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bands giving sequence, one was an Fc-derived fragment, the other a cleavage product of the 
METHLFc fusion starting at L800 (containing two of the thrombospondin-like domains). 
This suggests that METHl is processed with at least 2 cleavage sites (possibly more) since 
only the C-terminal fragments still linked to the Fc fragment would be purified on the 
5 Protein A column. 

The transfected 293T cells were conditioned in medium containing 1 % dialyzed, low 
IgG, fetal calf serum to attempt to decrease the proteolysis of the recombinant secreted 
protein. The purification and analysis was as described above. The yield of protein was 
significantly higher than the first batch, possibly reflecting the effect of the serum in the 
10 medium. While some processing may have been slowed by the serum, the majority of the 

jl protein remained approximately 3 1 kD on a reducing gel. 

rj; N-terminal sequencing of resolved bands under reducing conditions indicated the 

W protein is processed at L800 of the 950 residue METHl orf, with other possible cleavage 

m occurring N-terminal to this site. The observed cleavage site was considered unusual since 

"flS it followed a Pro. A total of 197.4 ^ig of protein was isolated (HG12100 -D293T1). Analysis 

g of flag protein (pFlag-CMV-5a:METHl), consisting of at least three bands on the Western 

1^ blot (120, 97 and 21 Kd) indicated that only one band (21 kd) could be confirmed as 

METHl and the other bands were of non-METHl origin. 

Since sequencing of the purified METH 1 Fc protein suggested an unusual cleavage 
20 site, a second batch of METHl Fc was prepared with cells grown in 1% FBS, to possibly 

inhibit undesirable processing. A preliminary assessment of the product suggests that no 

difference in processing resulted from the change in medium, but protein yields were 

increased. 

Functional assays of the initial Fc and Flag protein supernatants performed included 
25 proliferation of Human Microvascular Endothelial Cells (HMVECs) and in vitro cord 

formation using Bovine aortic endothelial cells (B AECs). The proliferation assay indicated 
increased rates of HMVEC proliferation in response to both culture supernatants, which may 
be attributable to high background stimulation from the conditioned medium. Cord 
formation assays of both the Fc and Flag supernatants indicated inhibition of cord formation 
30 relative to a mediunni/collagen control in two independent experiments. 
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Example 55. In vitro activity ofMETHl 
Proliferation 

HMVECs were used in an alamar blue assay to determine if METHl supernatants 
have functional anti-angiogenic activity, detectable by an inhibition of EC proliferation. 
5 FGF-2 was used as the primary stimulus for proliferation and culture supernatants were used 

at a 1:4 final dilution. The proliferation assays indicated significantly increased rates of 
HMVEC proliferation in response to both culture supernatants, which may be attributable 
to high background stimulation from the conditioned medium. This problem should be 
reduced or eliminated by the use of purified proteins. 

O 10 Cord formation 

{y The addition of soluble type I collagen to endothelial cells and the appropriate 

. growth factors will induce the production of tube-like structures or cords of endothelial cells 

f^l in culture which involves both the migration of endothelial cells and the selective deletion 

f n (apoptosis) of cells not involved in these structures. Bovine aortic endothelial cells (B AECs) 

J -1 1 5 were used to detect inhibition of stable cord formation when cultured with METH 1 -Fc and 

^^■^ METHl -Flag containing culture supernatants at a 1:4 dilution.. Qualitative assessment of 

vJ the cord formation indicated inhibition with both of the tested supernatants relative to the 

collagen-treated control. However, a non-matched conditioned medium control also 
generated inhibition of cord formation, suggesting that non-specific cellular toxicity might 
20 also contribute to the observed inhibition. 

The studies described in this example tested activity in METHl protein. However, 
one skilled in the art could easily modify the exemplified studies to test the activity of 
METH2 polypeptides, METHl and/or METH2 polynucleotides (e.g., gene therapy), 
agonists, and/or antagonists of METHl and/or METH2. 
25 It will be clear that the invention may be practiced otherwise than as particularly 

described in the foregoing description and examples. 

Numerous modifications and vctriations of the present invention are possible in light 
of the above teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of all publications (including patents, patent applications, 
30 journal articles, laboratory manuals, books, or other documents) cited herein are hereby 

incorporated by reference. 
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What Is Claimed Is: 

An isolated nucleic acid molecule comprising a polynucleotide selected from 
the group consisting of: 

(a) a polynucleotide encoding amino acids 1 to 950 in SEQ ID NO:2; 
5 (b) a polynucleotide encoding amino acids 2 to 950 in SEQ ED NO:2; 

(c) a polynucleotide encoding amino acids 29 to 950 in SEQ ID NO:2; 

(d) a polynucleotide encoding amino acids 30 to 950 in SEQ ID NO:2; 

(e) a polynucleotide encoding the complete amino acid sequence encoded 
by the cDNA clone contained in ATCC Deposit No. 20958 1 ; 

1§ (f) a polynucleotide encoding the mature amino acid sequence encoded 

Q by the cDNA clone contained in ATCC Deposit No. 20958 1 ; 

IS (g) a polynucleotide encoding amino acids 1 to 968 of SEQ ID NO: 125; 

■^l (h) a polynucleotide encoding amino acids 235 to 459 in SEQ ID NO: 2; 

S5 (i) a polynucleotide encoding amino acids 460 to 544 in SEQ ID NO:2; 

P (j) a polynucleotide encoding amino acids 545 to 598 in SEQ ID NO:2; 

f " (k) a polynucleotide encoding amino acids 84 1 to 894 in SEQ ID NO: 2; 

C= (1) a polynucleotide encoding amino acids 895 to 934 in SEQ ID NO:2; 

(m) a polynucleotide encoding amino acids 536 to 613 in SEQ ID NO:2; 

and 

20 (n) a polynucleotide encoding amino acids 549 to 563 in SEQ ID NO:2. 

2. A method for making a recombinant vector comprising inserting an isolated 
nucleic acid molecule of claim 1 into a vector in operable linkage to a promoter. 

3. A recombinant vector produced by the method of claim 2. 

25 4. A method of making a recombinant host cell comprising introducing the 

recombinant vector of claim 3 into a host cell. 

5. A recombinant host cell produced by the method of claim 4. 
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6. A recombinant method for producing a polypeptide, comprising culturing 
the recombinant host cell of claim 5 under conditions such that said polypeptide is expressed 
and recovering said polypeptide. 

l/ An isolated nucleic acid molecule, comprising 50 contiguous nucleotides of 
the comng region of SEQ ID NO: 1, or the complement thereof, wherein said nucleic acid 
molecule does not comprise any one of SEQ ID NOs: 14-41. 



An isolated nucleic acid molecule comprising a polynucleotide selected from 
the group consisting of: 

3 (a) a polynucleotide encoding amino acids 1 to 890 in SEQ ID NO:4; 

:f€) (b) a polynucleotide encoding amino acids 2 to 890 in SEQ ID NO:4; 

y (c) a polynucleotide encoding amino acids 24 to 890 in SEQ ID NO:4; 

m (d) a polynucleotide encoding amino acids 1 12 to 890 in SEQ ID NO:4; 

^ (e) a polynucleotide encoding the complete amino acid sequence encoded 

C by the cDNA clone contained in ATCC Deposit No. 209582; 

II (f) a polynucleotide encoding the mature amino acid sequence encoded 

JJ^ by the cDNA clone contained in ATCC Deposit No. 209582; 

(g) a polynucleotide encoding amino acids 2 14 to 439 in SEQ ID NO:4 

(h) a polynucleotide encoding amino acids 440 to 529 in SEQ ID NO:4 

(i) a polynucleotide encoding amino acids 530 to 583 in SEQ ID NO:4 
20 (j) a polynucleotide encoding amino acids 837 to 890 in SEQ ID NO:4 

(k) a polynucleotide encoding amino acids 280 to 606 in SEQ ID NO:4: 

and 

(1) a polynucleotide encoding amino acids 529 to 548 in SEQ ID NO:4. 

9, A method for making a recombinant vector comprising inserting an isolated 
25 nucleic acid molecule of claim 8 into a vector in operable linkage to a promoter. 



10. A recombinant vector produced by the method of claim 9. 
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11. A method of making a recombinant host cell comprising introducing the 
recombinant vector of claim 10 into a host cell. 

12. A recombinant host cell produced by the method of claim 11. 

13. A recombinant method for producing a polypeptide, comprising culturing 
the recombinant host cell of claim 12 under conditions such that said polypeptide is 
expressed and recovering said polypeptide. 



An isolated nucleic acid molecule, comprising 50 contiguous nucleotides of 
the CQ&ng region of SEQ ID NO: 3, or the complement thereof, wherein said nucleic acid 
molecule does not comprise any one of SEQ ID NOs: 19-22, 24 or 42-77. 



^•10 An isolated polypeptide comprising an amino acid sequence selected from 

[n the group consisting of: 

h (a) amino acids 1 to 950 in SEQ ID NO:2; 

p3 (b) amino acids 2 to 950 in SEQ ID NO:2; 

ill (c) amino acids 29 to 950 in SEQ ID NO:2; 

:|5 (d) amino acids 30 to 950 in SEQ ID NO:2; 

(d) the complete amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209581; 

(e) the a mature amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209581; 

20 (f) amino acids 1 to 968 of SEQ ID NO: 125; 

(g) amino acids 235 to 459 in SEQ ID NO:2; 

(h) amino acids 460 to 544 in SEQ ID NO:2; 

(i) amino acids 545 to 598 in SEQ ID NO:2; 
(j) amino acids 841 to 894 in SEQ ID NO:2; 

25 (k) amino acids 895 to 934 in SEQ ID NO:2; 

(1) amino acids 536 to 613 in SEQ ID NO:2; 
(m) amino acids 549 to 563 in SEQ ID NO:2. 



and 
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\K An isolated polypeptide comprising an amino acid sequence selected from 
the grcup consisting of: 

(a) amino acids 1 to 890 in SEQ ID NO:4; 

(b) amino acids 2 to 890 in SEQ ID NO:4; 
5 (c) amino acids 24 to 890 in SEQ ID NO:4; 

(d) amino acids 1 12 to 890 in SEQ ID NO:4; 

(e) the complete amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209582; 

(f) the mature amino acid sequence encoded by the cDNA clone 
10 contained in ATCC Deposit No. 209582; 

(g) amino acids 214 to 439 in SEQ ID NO:4; 

(h) amino acids 440 to 529 in SEQ ID NO:4; 

(i) amino acids 530 to 583 in SEQ ID NO:4; 
(j) amino acids 837 to 890 in SEQ ID NO:4; 

15 (k) amino acids 280 to 606 in SEQ ID NO:4; and 

(1) amino acids 529 to 548 in SEQ ID NO:4. 

A polypeptide comprising the amino acid sequence m~n of SEQ ID NO:2, 
wherein m is an integer of 1 to 950, and wherein n is an integer of 10 to 950. 

^^/^ A polypeptide comprising the amino acid sequence m-n of SEQ ID NO:4, 
20 wherein m is an integer of 1 to 890, and wherein n is an integer of 10 to 890. 

A method for inhibiting angiogenesis in an individual, comprising 
administering an effective amount of METHl or METH2. 



20. The method of claim 1 9, wherein said method is used to treat cancer, benign 
tumors, an ocular angiogenic disease, rheumatoid arthritis, psoriasis, delayed wound healing, 
25 endometriosis, vasculogenesis, granulations, hypertrophic scars, nonunion fractures, 

scleroderma, trachoma, vascular adhesions, myocardial angiogenesis, coronary collaterals, 
cerebral collaterals, arteriovenous malformations, ischemic limb angiogenesis, Osier- 
Webber Syndrom, plaque neovascularization, telangiectasia, hemophiliac joints, 
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angiofibroma, fibromuscular dysplasia, wound granulation, Crohn's disease or 
atherosclerosis, 

21. The method of claim 19, wherein said method is used in birth control, 

22 . The method of claim 1 9 , further comprising administering another angiogenic 
compound. 



23. The method of claim 19, wherein said METHl or METH2 is administered 
by cell or gene therapy means wherein cells have been modified to produce and secrete 
METHl orMETH2. 
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METHl and METH2 Polynucleotides and Polypeptides 

Abstract 

The present invention relates to novel anti-angiogenic proteins, related to 
thrombospondin. More specifically, isolated nucleic acid molecules are provided encoding 
human METHl and METH2. METHl and METH2 polypeptides are also provided, as are 
vectors, host cells and recombinant methods for producing the same. Also provided are 
diagnostic methods for the prognosis of cancer and therapeutic methods for treating 
individuals in need of an increased amount of METHl or METH2. Also provided are 
methods for inhibiting angiogenesis using METHl or METH2. 

A269-73,wpd 



ATGGGGAACGCGG^lGCGGGCTCCGGGGTCTCGGAGCTTTGGGCCCGTACCCACGCTGCTGCTGCTCGCCGCGGCGCTA 

mgnaerap'gsrs fgpvpti»i»IiL-aaai, 
ctggccgtgtcggacgcactcgggcgcccctccgaggaggacgaggagctagtggtgccggagctggagcgcgccccg 

I/AVSDALGRPSEEDEELVVPELERap 

GGACACGGGACCACGCGCCTCCGCCTGCACGCCTTTGACCAGCAGCTGGATCTGGAGCTGCGGCCCGACAGCAGCT 
GHGTTRLRLHAFDQQLDLELRPDSSF 

TTGGCGCCCGGCTTCACGCTCCAGAACGTGGGGCGCAAATCCGGGTCCGAGACGCCGCTTCCGGTVTUICCGAC^^ 
IiAPGFTLQNVG-RKSGSETPLPET DLA 

CACTGCTTCTACTCCGGCyvCCGTGAATGGCGATCCCAGCTCGGCTGCCGCCCTCAGCCTCTGCGAGGGCGTGCGCGGC 
H. CFYS GTVNGDP SSAAALSLCEGVRG 

GCCTTCTACCTGCTGGGGGAGGCGTATTTCATCCAGCCGCTGCCCGCCGCCAGCGAGCGCCTCGCCACCGC 
AFYLLGEAYFIQPLPAASERLATAAP 

GGGGAGAAGCCGCCGGCACCACTACy\GTTCCACCTCCTGCGGCGGAATCGGGAGGGCGACGTAGGCGGa\CGTGC 
GEKP PAP IxQFHIiLRRNRQGDVGGTCG 

GTCGTGGACGACGAGCCCCGGCCGACTGGGA?AGCGGAGACCGAAG?VCGAGGACG7^GGGACTGAGGGCGAGGACGAA 
VVDDEPRPT GKAETEDEDEGTEGEDE. 

GGGCCTCAGTGGTCGCCGCAGGACCCGGCACTGCAAGGCGTAGGACAGCCCACAGGAACTGGAAGCATAAGAAAGAAG 
G PQWS PQDP ALQGVGQPTGTGSI RKK 

CGATTTGTGTCCAGTCACCGCTATGTGGAAACCATGCTTGTGGCAGACCAGTCGATGGCAGAATTCCACGGCAGTGGT 
R FV S S H R YV E TM LVADQSMAE FH 'G S G 

CTAAAGCATTACCTTCTCT^CGTTGTTTTCGGTGGCAGCCAGATTGTACAAACACCCCAGCATTCGTAATTCAGT 
IxKHYLLTLFSV AARIiYKHPSIRNSVS 

CTGGTGGTGGTGAAGATCTTGGTCATCCACGATGAACAGAAGGGGCCGGAAGTGACCTCCAATGCTGCCCTCACTCTG 
LVVVKI LVI HDEQK GPEVTSNAALTL 

CGGAACTTTTGCAACTGGCAGAAGCAGCACAACCCACCCAGTGACCGGGATGCAGAGCACTATGACACAGCAATTCT 
RNF CNVJQKQHNPPSDRDAEHYBTAIL 

TTCACCAGACAGGACTTGTGTGGGTCCCAGACATGTGATACTCTTGGGATGGCTGATGTTGGAACTGTGTGTGATCCG 
FTRQDLCGS QTCDTLGMADVGT-.-VCDP^ 

AGCAGAAGCTGCTCCGTCATAGAAGATGATGGTTTACAAGCTGCCTTCACCACAGCCCATGAATTAGGCCACGTGTTT 
S RSCSVI EDDGLQAAFTTAHELGHVF 

AACATGCCACATGATGATGCAAAGCAGTGTGCCAGCCTTAATGGTGTGAACCAGGATTCCCACATGATGGCGTCAATG 
NMPHDDAKQCASLNGVNQDSHMMASM 

CTTTCCAACCTGGACCACAGCCAGCCTTGGTCTCCTTGCAGTGCCTACATGATTACATCATTTCXGGATAATGGTCAT 
LSNLDHSQPWSPCS AYMITSFLDNGH 

GGGGAATGTTTGATGGACAAGCCTCAGAATCCCATACAGCTCCCAGGCGATCTCCCTGGCACCTCGT^ 
GECLMDKPQNPIQLPGDLPGTSYDAN 

CGGCAGTGCCAGTTTACATTTGGGGAGGACTCCAAACACTGCCCTGATGCAGCCAGCACATGTAGCACCTTGTGGTGT 
RQCQFTFGEDSKKCPDAASTCSTLWC 

ACCGGCACCTCTGGTGGGGTGCTGGTGTGTCAAACCAAACACTTCCCGTGGGCGGATGGCACCAGCTGTGGAGA^ 
TGTSGGVLVCQTKHFPWADGTSCGEG 

AAATGGTGTATCAACGGCAAGTGTGTGAACAAAACCGACAGAAAGCATTTTGATACGCCTTTTCATGGAAGCTGGGGA 
KWCIN GKCVNKTDR^KHFDTPFH'GSWG 
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MGTGGGGGCCTTGGGGAGACTGTTCGAGAACGTGC<5GTGGA.GGAGTCCA<3TACACGATGAGGGAATGTG^ 

MWGPWGDCSRTCGGGVQ. YTMRECDUp 

GTCCCAAAGTVATGGAGGdAAGTACTGTGAAGGCAAACGAGTGCGCT^ 
VPKNGGKYCEGKRVRYRSCNLEDCPD 

AATAATGGAAAAACCTTTAGAGAGGAACAATGTGAAGCACACAACGAGTTTTC^^ 
NNGKTFREEQCEAHNEFSKASFGSGP 

GCGGTGGAATGGATTCCCAAGTACGCTGGCGTCTCACCAAAGGACAGGTGCP^ 
AVEWI PKYAGVS PKDRCKLICQAKGI 

GGCTACTTCTTCGTTTTGCAGCCC7VAGGTTGTAGATGGTACTCCATGTAGCCCAGATTCCACCT 
GYFFVLQ PKVVD GTPCSPDSTSVCVQ 

GGACAGTGTGTAAAAGCTGGTTGTGATCGCATCATAGACTCCAAAAAGAAG^^ 
GQCVKAGCDRIIDSKKKFDKCGVCGG 

AATGGATCTACTTGTAAAAAAATATCAGGATCAGTTACTAGTGCAAAACCTGGATATCATGATAT^^ 
NGSTCKKISGSVTSAKPGYHDIITIP 

ACTGGAGCCACCAACATCGAAGTGAAACAGCGGAACCAGAGGGGATCCAGGAACAATG^ 
T GAT N l EVKQRNQRGSRNNGS FLAX K 

GCTGCTGATGGCACATATATTCTTAATGGTGACTACACTTTGTCCACCTTAGAGCAAGACATTATGTACAAAGGXGTT 
AADGTYI LNGDYTLSTLEQDIMYKGV 

GTCTTGAGGTACAGCGGCTCCTCTGCGGCATTGGAAAGAATTCGCAGCTTTAGCCCTCTCAAAGAGCCCTTGACCATC 
VLRYS GS SAALERIR SFS PLKEPLTI 

CAGGTTCTTACTGTGGGCAATGCCCTTCGACCTAAAATTAAATACACCTACTTCGTAAAGAAGAAGAAGGA^ 
QVLTVGNALRPKI'KYTYFVKKKKES F 

AATGCTATCCCCACTTTTTCAGCATGGGTCATTGAAGAGTGGGGCGAATGTTCTAAGTCATGTGAATTGGGTTGGCAG 
n'ai P T.FS AWVI EEWGECS KSC ELGWQ 

AGAAGACTGGTAGAATGCCGAGACATTAATGGACAGCCTGCTTCCGAGTGTGCAAA.GGAAGTGAAGCCAGCCAGCACC 
RRLVECRDINGQPASECAKEVKPAST 

agaccttgtgcagaccatccctgcccccagtggcagctgggggagtggtcatcatgttctaagacci:gtgggaagggt 
rpcadhpcpqwqlgewsscsktcgkg 

tacaa/^aaaagaagcttgaagtgtctgtcccatgatggaggggtgttatctcatgagagctgtgatcctttaaagaaa 

YKKRS LKCLSHDGGVLSHESCDPLKK 

cctaaacatttcatagacttttgcacaatggcagaatgcagttaagtggtttaagtggtgttagctttgaggcaaggc 
pkhfi dfctmaecs- 

AAAGTGAGGAAGGGCTGGTGCAGGGA2VAGCAAGAAGGCTGGAGGGATCCAGCGTATCTTGCCAGTA?VCCAGTGAGGTG 

tatcagtaaggtgggattatgggggtagatagaaaaggagttgaatcatcagagtaaactgccagttgcaaatttgat 
aggatagttagtgaggattattaacctctgagcagtgatatagcataataaanccccgggcattattattattatttc 
ttttgttacatctattacaagtttagaaaa^acaaagcaattgtcaaaaaaaaaa^aaa^^ 
gcggccgctctagaggatccctcgaggggcccaagcttacgcgtgcatgntgtcatnagtctn 
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ATGTTCCCCGCCCCCGCCGGCCCCCGGTGGCTTCCGTTCCTGCTGCTGCTGCTGCTGCTGCTGCTGCCGCTGGCCCGC 
MFPAPAAP RWLP FLLLLLLLIiLFLAr 

GGCGCCCCGGCCCGGCCCGCAGCCGGGGGGCAGGCCTCGGAGCTGGTGGTGCCCACGCGGTTGCCCGGCAGCGCGGGC 
GAPARPAAGGQASEI/VVPT RLPGSAg 

gagctcgcgctccacctgtccgccttcggcaagggcttcgtgttgcgcctggcgcccgacgacagcttcctggcgccc 
elaIiHlsafgkgfvlrlapdds flap 

gagttc:aagmcgagcgcctcgggggctccggccgggcgaccgggggcga.gcgggggctgcgcggctgttttttttcc 

EFKI ERIiGGSGRATGGERGLRGCFFs 

ggcaccgtgaatggggagcccgagtcgctggcggcggtcagcctgtgccgcgggctgagcggctccttcctgctggac 
gtvngepeslaavsiicrglsgs flld 

ggcgaggagttcaccatccagccgcagggcgcggggggctccctggctcagccgcaccgcctgc^ 
geeftiqpqgaggs laqphrlqrwgp 

gccggagcccgccccctcccgcgaggacccgagtgggaggtggagacggga.gagggtcagaggcaggagagagga<^ 
agarplprgpewevetgegqrqergd 

caccaggaggacagcgaggaggagagccaagaagagg7vggcagaaggcgctagcgagccgccaccgcccctgggggcc 
hqedseeesqeeeaegaseppp plga 

acgagtaggaccaagcggtttgtgtctgaggcgcgcttcgtggagacgctgctggtggccgatgcgtccatggctgcc 
ts rt krfvs earfvetllvadasmaa 

ttctacggggccgacctgcagaaccacatcctgacgttaatgtctgtggcagcccgaatctacaagcaccccagcatc 
fygadlqnhiltlmsvaariykhp s i 

aagaattccatc:aacctgatggtggtaaaa.gtgctgatcgtagaagatgaaaaatggggccc^ 
kks i ni.mvv kvlivedekwgpevs dk 

ggggggcttacactgcgtaacttctgcaactggcagcggcgtttcaaccagcccagcgaccgccaccc^ 
ggltlrnfcnwqrrfnqpsdrhpehy 

gacacggccatcctgctcaccagacagaacttctgtgggcaggaggggctgtgtgacaccctgggtgtggcagacatc 
dtai lltrqnfcgqeglcdtlgvad I 

gggaccatttgtgaccccaaca^aagctgctccgtgatcgaggatgaggggctccaggcggcccacaccctggcccat 
gticdpnkscsviedeglqaah..tlah 

gaactagggcacgtcctcagcatgccccacgacgactccaagccctgcacacggctcttcgggcccatgggcaagcac 
elghvlsmphddskpctrlfgpmgkh 

cacgtgatggcaccgctgttcgtccacctgaaccagacgctgccctggtccccctgcagcgccatgtatctcacagag 
hvmap l f vhlhqtlpws pcsamylt e 

cttctggacggcgggcacggagactgtctcctggatgcccctggtgcggccctgcccctccccacaggcctcccgggc 
lldgghgdclldapgaalplp'tglpg 

cgcatggccctgtaccagctggaccagcagtgcaggcagatctttgggccggatttccgccactgccccaacacctct 
rmalyqldqqcrqifgpdfrhcpnt s 

gctcaggacgtctgcgcccagctttggtgccacactgatggggctgagcccctgtgccacacgaagaatggcagcctg 

AQDVCAQLWCHTDGAEPLCHTKNGS li 
CCCTGGGCTGACGGCACGCCGTGCGGGCCTGGGCACCTCTGCTCAGAAGGCAGCTGTCTACCTGAGGAGGAAGTGGAG 

pwadgtpcgpghlcsegsciipeeeve 
aggcccaagcccgtggtagatggaggctgggcaccgtggggaccctggggagaatgttctcggacctgtggaggagga 

RPKPVVDGGWAPWGPWGECSRTCGGG 
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vqfshreckdpepqnggryclgrrak 

taccagtcatgccacacggaggaatgcccccctgacgggaaaagcttcagggagcagcagtgtgagaagtataatgcc 
yqschteecppdgksfreqqcekyna 

tacaattacactgacatggacgggaatctcctgcagtgggtccccaagtatgctggggtgtccccccgggaccgctgc 

YNY T DMD GNIiLQWVPKYAGVS PRDRc 

aagttgttctgccgagcccgggggaggagcgagttcaaagtgttcgaggccaaggtgattgatggcaccct 
klfcrargrsefkvfeakvidgtlcg 

CC:AGA7\ACACTGGCCATCTGTGTCCGTGGCC71GTGTGTCAAGGCCGGCTGTGACCATGTGGTGGACTCG 
P ET LAI CVRGQCVKAG CDHVVDS P RK 

CTGGA€7WVTGCGGGGTGTGTGGGGGCAAAGGC7U^CTCCTGCAGGAAGGTCTCCGGGTCCCTC^^ 
LDKCGVCGGKGl^SCRKVSGSLTPTNY 

GGCTACAATGACATTGTCACCy^TCCCAGCTGGTGCCACTAATATTGACGTGA^ 

gyn d i vt i p a gat "n i d vkqrs hp gvq 

aacgatgggaactacctggcgctgaagacggctgatgggcagtacct'gctcaacggcaacctggcc^ 
ndgnylalktadgqyllngniials .ai 

gagctvggacatcttggtgaaggggaccatcctgaagtacagcggctccatcgccaccctggagcgcctgc^ 

EQDI LVKGTI LKYSGSIATLERLQ .SF 

CGGCCCTTGCCAGAGCCTCTGACAGTGCAGCTCCTGACAGTCCCTGGCGAGGTCTTCCCCCCAAAAGTCAAATACACC 
RPLP E P LTVQXiLTVPGEVFPPKVKYT 

TTCTTTGTTCCTAATGACGTGGACTTTAGCATGCAGAGCAGCAAAGAGAGAGCAACCACC^ 
FFVPNDV. DFSMQSSKERATTKIIQPIi 

CTCCACGCACAGTGGGTGCTGGGGGACTGGTCTGAGTGCTCTAGCACCTGCGGGGCCGGCTGGCAGAGGCGAACTGTA 
LHAQWVLGDWSECS STCGAGWQRRTV 

GAGTGCAGGGACCCCTCCGGCCyVGGCCTCTGCCACCTGCAACAAGGCTCTGAAACCCGAGGATGCCJ^ 
ECRD P SGQASATCNKALKPEDAKPCE 

AGCCAGCTGTGCCCCCTGTGATTCAGGGGGGCAGGGGCCAGTCTTGTGCTCCTGGACATGCGGTACTGAGGTGCAGAC 
S Q L C P L 

AAGGTCTCCACTGTGGTGACTGGGTCCCTTGGCCATATCAAGGCAGCACGGCCCACCCAGG 

CCCTCCAGTACTGCACAAATTCCTAAGGGGGAAGAGAAAAGGTATGGGGCGGCAAAACCTATCATCAACTGTCCM?TG 

NAATGGAACTTGCTCGGGTTCAATTAAAGGCM?AAGTTAAAGTAAATTCATTATGATCAACAGACCT 

TTGCANGATACAACTANTAAAAAA?y\AAAAAAAA7\AAAAAAAAA 
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SEQUENCE LISTING 

<110> Iruela-Arispe, Luisa 
Hastings, Gregg A. 
Ruben, Steven M. 
Jonak, Zdenka L. 
Trulli, Stephen H. 
Fronwald, James A. 
Terrett, Jonathan A. 

<120> Methl and Meth2 Polynucleotides and Polypeptides 
<130> 1488.1070006 

<140> 
<141> 

<160> 125 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 3261 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2853) 

<220> 

<221> UNSURE 
<222> (3095) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3248) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
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<222> (3255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3261) 

<223> May be any nucleic acid 

ggg tct egg age ttt ggg ccc gta 48 
Gly Ser Arg Ser Phe Gly Pro Val 
10 15 



<400> 1 

atg ggg aac gcg 
Met Gly Asn Ala 
1 



gag egg get ccg 
Glu Arg Ala Pro 
5 



ccc acg ctg ctg ctg etc gee gcg gcg eta ctg gee gtg teg gac gca 96 
Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 



etc ggg cgc ccc tec gag gag gac gag gag eta gtg gtg ccg gag ctg 14 4 
Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro Glu Leu 
35 40 45 



gag cgc gee ccg gga cac ggg acc acg cgc etc cgc ctg cac gee ttt 192 
Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 
50 55 60 

gac cag cag ctg gat ctg gag ctg egg ccc gac age age ttt ttg gcg 240 
Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
65 70 75 80 

ccc ggc ttc acg etc cag aac gtg ggg cgc aaa tee ggg tec gag acg 288 
Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Glu Thr 
85 90 95 

ccg ctt ccg gaa acc gac ctg gcg cac tgc ttc tac tec ggc acc gtg 336 
Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 110 

aat ggc gat ccc age teg get gcc gcc etc age etc tgc gag ggc gtg 384 
Asn Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu Gly Val 
115 120 125 



cgc ggc gcc ttc tac ctg ctg ggg gag gcg tat ttc ate cag ccg ctg 



432 
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Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe lie Gin Pro Leu 
130 135 140 

ccc gcc gcc age gag cgc etc gcc acc gee gcc oca ggg gag aag ccg 480 

Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu Lys Pro 
145 150 155 160 

ccg gca cca eta cag ttc cac etc ctg egg egg aat egg cag ggc gac 528 

Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 170 175 

gta ggc ggc acg tgc ggg gtc gtg gac gac gag ccc egg ccg act ggg 57 6 

Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 190 

aaa gcg gag acc gaa gac gag gac gaa ggg act gag ggc gag gac gaa 624 

Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 

195 200 205 

ggg cct cag tgg teg ccg cag gac ccg gca ctg caa ggc gta gga cag 672 

Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 
210 215 220 

ccc aca gga act gga age ata aga aag aag cga ttt gtg tec agt cac 720 

Pro Thr Gly Thr Gly Ser lie Arg Lys Lys Arg Phe Val Ser Ser His 
225 230 235 240 

cgc tat gtg gaa acc atg ctt gtg gca gac cag teg atg gca gaa ttc 7 68 

Arg Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 
245 250 255 

cac ggc agt ggt eta aag cat tac ctt etc acg ttg ttt teg gtg gca 816 

His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 
260 265 270 

gcc aga ttg tac aaa cac ccc age att cgt aat tea gtt age ctg gtg 864 

Ala Arg Leu Tyr Lys His Pro Ser lie Arg Asn Ser Val Ser Leu Val 

275 280 285 



gtg gtg aag ate ttg gtc ate cac gat gaa cag aag ggg ccg gaa gtg 
Val Val Lys He Leu Val He His Asp Glu Gin Lys Gly Pro Glu Val 



912 
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290 295 300 

acc tec aat get gcc etc act ctg egg aac ttt tgc aac tgg cag aag 960 

Thr Ser Asn Ala. Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 

cag cac aac eca ccc agt gac egg gat gca gag cac tat gac aca gca 1008 

Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 

att ctt ttc acc aga cag gac ttg tgt ggg tec cag aca tgt gat act 1056 

lie Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

ctt ggg atg get gat gtt gga act gtg tgt gat ccg age aga age tgc 1104 

Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 

355 360 365 

tec gtc ata gaa gat gat ggt tta caa get gee ttc acc aca gcc cat 1152 

Ser Val lie Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 

gaa tta ggc cac gtg ttt aac atg eca cat gat gat gca aag cag tgt 1200 

Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys 
385 390 395 400 

gcc age ctt aat ggt gtg aac cag gat tec cac atg atg gcg tea atg 1248 

Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 

ctt tec aac ctg gac cac age cag cet tgg tet cet tgc agt gcc tac 1296 

Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

atg att aca tea ttt ctg gat aat ggt cat ggg gaa tgt ttg atg gac 1344 

Met lie Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu Met Asp 

435 440 445 

aag cet cag aat ccc ata cag etc eca ggc gat etc cet ggc acc teg 1392 
Lys Pro Gin Asn Pro lie Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 
450 455 460 
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tac gat gcc aac egg cag tgc cag ttt aca ttt ggg gag gac tec aaa 1440 
Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Giy Glu Asp Ser Lys 
465 470 475 480 



cac tgc cot gat gca gcc age aea tgt age ace ttg tgg tgt acc ggc 1488 
His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 



acc tct ggt ggg gtg ctg gtg tgt caa acc aaa cac ttc ccg tgg gcg 1536 
Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys His Phe Pro Trp Ala 
500 505 510 



gat ggc acc age tgt gga gaa ggg aaa tgg tgt ate aac ggc aag tgt 1584 
Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys lie Asn Gly Lys Cys 
515 520 525 



gtg aac aaa acc gac aga aag cat ttt gat acg ect ttt cat gga age 1632 
Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe His Gly Ser 
530 535 540 



tgg gga atg tgg ggg ect tgg gga gac tgt teg aga acg tgc ggt gga 1680 
Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly 
545 550 555 560 



gga gtc cag tac acg atg agg gaa tgt gac aac cea gtc cca aag aat 1728 
Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 
565 570 575 



gga ggg aag tac tgt gaa ggc aaa cga gtg cgc tac aga tec tgt aac 177 6 
Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 
580 585 590 



ctt gag gac tgt cca gac aat aat gga aaa ace ttt aga gag gaa caa 1824 
Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 
595 600 605 



tgt gaa gca cac aac gag ttt tea aaa get tee ttt ggg agt ggg ect 
Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 
610 615 620 



1872 
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gcg gtg gaa 
Ala Val Glu 
625 

tgc aag etc 
Cys Lys Leu 

cag ccc aag 
Gin Pro Lys 

gtc tgt gtg 
Val Cys Val 
675 

gac tec aaa 
Asp Ser Lys 
690 

tct act tgt 
Ser Thr Cys 
705 

tat cat gat 
Tyr His Asp 

aaa cag egg 
Lys Gin Arg 

ate aaa get 
lie Lys Ala 
755 

tec acc tta 
Ser Thr Leu 
770 

age ggc tee 



tgg att ccc 
Trp lie Pro 
630 

ate tge caa 
lie Cys Gin 
645 

gtt gta gat 
Val Val Asp 
660 

caa gga cag 
Gin Gly Gin 

aag aag ttt 
Lys Lys Phe 

aaa aaa ata 
Lys Lys lie 
710 

ate ate aca 
He He Thr 
725 

aac cag agg 
Asn Gin Arg 
740 

get gat ggc 
Ala Asp Gly 



gag caa gac 
Glu Gin Asp 

tct gcg gea 



aag tac get 
Lys Tyr Ala 

gee aaa ggc 
Ala Lys Gly 

ggt act cca 
Gly Thr Pro 
665 

tgt gta aaa 
Cys Val Lys 
680 

gat aaa tgt 
Asp Lys Cys 
695 

tea gga tea 
Ser Gly Ser 

att cca act 
lie Pro Thr 

gga tec agg 
Gly Ser Arg 
745 

aca tat att 
Thr Tyr He 
760 

att atg tac 
He Met Tyr 
775 

ttg gaa aga 



ggc gtc tea 
Gly Val Ser 
635 

att ggc tac 
He Gly Tyr 
650 

tgt age cca 
Cys Ser Pro 

get ggt tgt 
Ala Gly Cys 

ggt gtt tgc 
Gly Val Cys 
700 

gtt act agt 
Val Thr Ser 
715 

gga gee acc 
Gly Ala Thr 
730 

aac aat ggc 
Asn Asn Gly 

ctt aat ggt 
Leu Asn Gly 

aaa ggt gtt 
Lys Gly Val 
780 

att egc age 



cca aag gac 
Pro Lys Asp 

ttc ttc gtt 
Phe Phe Val 
655 

gat tec acc 
Asp Ser Thr 
670 

gat cgc ate 
Asp Arg He 
685 

ggg gga aat 
Gly Gly Asn 

gca aaa cct 
Ala Lys Pro 

aac ate gaa 
Asn He Glu 
735 

age ttt ctt 
Ser Phe Leu 
750 

gac tac act 
Asp Tyr Thr 
765 

gtc ttg agg 
Val Leu Arg 

ttt age ect 



agg 1920 

Arg 

640 

ttg 1968 
Leu 



tct 2016 
Ser 



ata 2064 
He 



gga 2112 
Gly 



gga 2160 

Gly 

720 

gtg 2208 
Val 



gee 2256 
Ala 

ttg 2304 
Leu 

tac 2352 
Tyr 

etc 2400 



Ser Gly Ser Ser Ala Ala Leu Glu Arg lie Arg Ser Phe Ser Pro Leu 
785 790 795 800 

aaa gag ccc ttg acc ate cag gtt ctt act gtg ggc aat gcc ctt cga 24 4 8 
Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

cct aaa att aaa tac acc tac ttc gta aag aag aag aag gaa tct ttc 24 96 
Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 

aat get ate ccc act ttt tea gea tgg gtc att gaa gag tgg ggc gaa 2544 
Asn Ala He Pro Thr Phe Ser Ala Trp Val He Glu Glu Trp Gly Glu 
835 840 845 

tgt tct aag tea tgt gaa ttg ggt tgg cag aga aga ctg gta gaa tgc 2592 
Cys Ser Lys Ser Cys Glu Leu Gly Trp Gin Arg Arg Leu Val Glu Cys 
850 855 860 

cga gac att aat gga cag cct get tec gag tgt gea aag gaa gtg aag 2640 
Arg Asp He Asn Gly Gin Pro Ala Ser Glu Cys Ala Lys Glu Val Lys 
865 870 875 880 

cca gcc age acc aga cct tgt gea gac cat ccc tgc ccc cag tgg cag 2688 
Pro Ala Ser Thr Arg Pro Cys Ala Asp His Pro Cys Pro Gin Trp Gin 
885 890 895 

ctg ggg gag tgg tea tea tgt tct aag ace tgt ggg aag ggt tac aaa 2736 
Leu Gly Glu Trp Ser Ser Cys Ser Lys Thr Cys Gly Lys Gly Tyr Lys 
900 905 910 

aaa aga age ttg aag tgt ctg tec cat gat gga ggg gtg tta tct cat 2784 
Lys Arg Ser Leu Lys Cys Leu Ser His Asp Gly Gly Val Leu Ser His 
915 920 925 

gag age tgt gat cct tta aag aaa cct aaa cat ttc ata gac ttt tgc 2832 
Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Phe He Asp Phe Cys 
930 935 940 



aca atg gea gaa tgc agt taa gtggtttaag tggtgttagc tttgaggcaa 
Thr Met Ala Glu Cys Ser 



2883 
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945 950 

ggcaaagtga ggaagggctg gtgcagggaa agcaagaagg ctggagggat ccagcgtatc 294 3 
ttgccagtaa ccagtgaggt gtatcagtaa ggtgggatta tgggggtaga tagaaaagga 3003 
gttgaatcat cagagtaaac tgccagttgc aaatttgata ggatagtbag tgaggattat 3063 
taacctctga gcagtgatat agcataataa anccccgggc attattatta ttatttcttt 3123 
tgttacatct attacaagtt tagaaaaaac aaagcaattg tcaaaaaaaa aaaaaaaaaa 3183 
aaaaaaaaaa aaagggcggc cgctctagag gatccctcga ggggcccaag cttacgcgtg 324 3 
catgntgtca tnagtctn 32 61 



<210> 2 
<211> 950 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly Pro Val 
15 10 15 

Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 

Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro Glu Leu 
35 40 45 

Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 
50 55 60 

Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
65 70 75 80 

Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Glu Thr 
85 90 95 
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Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 110 

Asn Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu Gly Val 
115 120 125 

Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe lie Gin Pro Leu 
130 135 140 

Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu Lys Pro 
145 150 155 160 

Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 170 175 

Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 190 

Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 
195 200 205 

Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 
210 215 220 

Pro Thr Gly Thr Gly Ser lie Arg Lys Lys Arg Phe Val Ser Ser His 
225 230 235 240 

Arg Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 
245 250 255 

His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 
260 265 270 

Ala Arg Leu Tyr Lys His Pro Ser lie Arg Asn Ser Val Ser Leu Val 
275 280 285 



Val Val Lys lie Leu Val lie His Asp Glu Gin Lys Gly Pro Glu Val 
290 295 300 



Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 
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Gln His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 

He Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 
355 360 365 

Ser Val He Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 

Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys 
385 390 395 400 

Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 

Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

Met He Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu Met Asp 
435 440 445 

Lys Pro Gin Asn Pro He Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 
450 455 460 

Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp Ser Lys 
465 470 475 480 

His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 

Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys His Phe Pro Trp Ala 
500 505 510 

Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys He Asn Gly Lys Cys 
515 520 525 



Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe His Gly Ser 
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530 



535 



540 



Trp Gly Met Trp Gly Pro Trp Giy Asp Cys Ser Arg Thr Cys Gly Gly 

545 550 555 560 

Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 

565 570 575 

Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 

580 ^ 585 590 

Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 

595 600 605 

Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 

610 615 620 

Ala Val Glu Trp lie Pro Lys Tyr Ala Gly Val Ser Pro Lys Asp Arg 

625 630 635 640 

Cys Lys Leu lie Cys Gin Ala Lys Gly lie Gly Tyr Phe Phe Val Leu 

645 650 655 

Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser Thr Ser 

660 665 670 

Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg He He 

675 680 685 

Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly Asn Gly 

690 695 700 

Ser Thr Cys Lys Lys He Ser Gly Ser Val Thr Ser Ala Lys Pro Gly 

705 710 715 720 



Tyr His Asp lie He Thr He Pro Thr Gly Ala Thr Asn He Glu Val 
725 730 , 735 



Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe Leu Ala 
740 745 750 
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Ile Lys Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asp Tyr Thr Leu 
"755 760 765 

Ser Thr Leu Glu Gin Asp He Met Tyr Lys Gly Val Val Leu Arg Tyr 
'^70 775 780 

Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser Pro Leu 
'^85 790 795 800 

Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 

Asn Ala He Pro Thr Phe Ser Ala Trp Val He Glu Glu Trp Gly Glu 
835 840 845 

Cys Ser Lys Ser Cys Glu Leu Gly Trp Gin Arg Arg Leu Val Glu Cys 
850 855 860 

Arg Asp He Asn Gly Gin Pro Ala Ser Glu Cys Ala Lys Glu Val Lys 
865 870 875 880 

Pro Ala Ser Thr Arg Pro Cys Ala Asp His Pro Cys Pro Gin Trp Gin 
885 890 895 

Leu Gly Glu Trp Ser Ser Cys Ser Lys Thr Cys Gly Lys Gly Tyr Lys 
900 905 910 

Lys Arg Ser Leu Lys Cys Leu Ser His Asp Gly Gly Val Leu Ser His 
915 920 925 

Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Phe He Asp Phe Cys 
930 935 940 



Thr Met Ala Glu Cys Ser 
945 950 



<210> 3 
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<211> 3008 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2670) 

<220> 

<221> UNSURE 
<222> (2887) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2957) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2970) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2981) 

<223> May be any nucleic acid 
<400> 3 

atg ttc ccc gcc ccc gcc gcc ccc egg tgg ctt ccg ttc ctg ctg ctg 48 
Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
^5 10 15 

ctg ctg ctg ctg erg ctg ccg ctg gcc cgc ggc gcc ccg gcc egg ccc 96 
Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 

gca gcc ggg ggg cag gcc teg gag ctg gtg gtg ccc acg egg ttg ccc 14 4 
Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 
35 40 45 

ggc age gcg ggc gag etc gcg etc cac ctg tec gee ttc ggc aag ggc 192 
Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ala Phe Gly Lys Gly 
50 55 60 
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ttc gtg ttg 
Phe Val Leu 
65 

aag ate gag 
Lys lie Glu 



ggg ctg cgc 
Gly Leu Arg 

teg ctg gcg 
Ser Leu Ala 
115 

ctg gac ggc 
Leu Asp Gly 
130 

ctg get cag 
Leu Ala Gin 
145 

CGC etc ccg 
Pro Leu Pro 



agg cag gag 
Arg Gin Glu 



gaa gag gag 
Glu Glu Glu 
195 

acg agt agg 
Thr Ser Arg 
210 



cgc ctg gcg ccc 
Arg Leu Ala Pro 
70 

cgc etc ggg ggc 
Arg Leu Gly Gly 
85 

ggc tgt ttt ttt 
Gly Cys Phe Phe 
100 

gcg gtc age ctg 
Ala Val Ser Leu 



gag gag ttc acc 
Glu Glu Phe Thr 
135 

ccg cac cgc ctg 
Pro His Arg Leu 
150 

cga gga ccc gag 
Arg Gly Pro Glu 
165 

aga gga gac cac 
Arg Gly Asp His 
180 

gca gaa ggc get 
Ala Glu Gly Ala 



acc aag egg ttt 
Thr Lys Arg Phe 
215 



gac gac age 
Asp Asp Ser 

tec ggc egg 
Ser Gly Arg 
90 

tec ggc acc 
Ser Gly Thr 
105 

tge cgc ggg 
Cys Arg Gly 
120 

ate cag ccg 
lie Gin Pro 



cag cgc tgg 
Gin Arg Trp 

tgg gag gtg 
Trp Glu Val 
170 

cag gag gac 
Gin Glu Asp 
185 

age gag ccg 
Ser Glu Pro 
200 

gtg tct gag 
Val Ser Glu 



ttc ctg gcg ccc 
Phe Leu Ala Pro 
75 

gcg ace ggg ggc 
Ala Thr Gly Gly 



gtg aat ggg gag 
Val Asn Gly Glu 
110 

ctg age ggc tee 
Leu Ser Gly Ser 
125 

cag ggc gcg ggg 
Gin Gly Ala Gly 

140 

ggt ccc gee gga 
Gly Pro Ala Gly 
155 

gag acg gga gag 
Glu Thr Gly Glu 



age gag gag gag 
Ser Glu Glu Glu 
190 

cca ccg ccc ctg 
Pro Pro Pro Leu 
205 

gcg cgc ttc gtg 
Ala Arg Phe Val 
220 



gag ttc 240 
Glu Phe 
80 

gag egg 288 
Glu Arg 
95 

ccc gag 336 
Pro Glu 



ttc ctg 384 
Phe Leu 



ggc tec 432 
Gly Ser 

gee cgc 480 
Ala Arg 
160 

ggt cag 528 

Gly Gin 

175 

age eaa 576 
Ser Gin 



ggg gee 62 4 
Gly Ala 



gag acg 672 
Glu Thr 
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ctg ctg gtg gcc gat gcg tec atg get gcc ttc tac ggg gcc gac ctg 720 
Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 
225 230 235 240 

cag aac cac ate ctg acg tta atg tct gtg gca gcc cga ate tae aag 768 
Gin Asn His He Leu Thr Leu Met Ser Val Ala Ala Arg He Tyr Lys 
245 250 255 

cae eec age ate aag aat tec ate aac ctg atg gtg gta aaa gtg ctg 816 
His Pro Ser He Lys Asn Ser He Asn Leu Met Val Val Lys Val Leu 
260 265 270 

ate gta gaa gat gaa aaa tgg ggc cca gag gtg tec gac aat ggg ggg 8 64 
He Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 

ctt aca ctg egt aae ttc tge aae tgg eag egg egt ttc aac eag ccc 912 
Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro 
290 295 300 

age gac ege eae cca gag cac tac gac acg gee ate ctg etc ace aga 960 
Ser Asp Arg His Pro Glu His Tyr Asp Thr Ala He Leu Leu Thr Arg 
305 310 315 320 

cag aac ttc tgt ggg eag gag ggg ctg tgt gac ace ctg ggt gtg gea 1008 
Gin Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 

gac ate ggg aec att tgt gac ccc aae aaa age tge tec gtg ate gag 1056 
Asp He Gly Thr He Cys Asp Pro Asn Lys Ser Cys Ser Val He Glu 
340 345 350 

gat gag ggg etc cag gcg gee cac ace ctg gee eat gaa eta ggg cae 1104 
Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 

gtc etc age atg ccc cac gac gac tee aag ccc tge aca egg etc ttc 1152 
Val Leu Ser Met Pro His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe 
. 370 375 380 



ggg ccc atg ggc aag cac eae gtg atg gca ecg ctg ttc gtc cac ctg 



1200 
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Gly Pro Met Gly Lys His His Val Met Ala Pro Leu Phe Val His Leu 
385 390 395 400 



aac cag acg ctg ccc tgg tec ccc tgc age gcc atg tat etc aca gag 
Asn Gin Thr Leu Pro Trp Ser Pro Cys Ser Ala Met Tyr Leu Thr Glu 
405 410 415 



1248 



ctt ctg gac ggc ggg cac gga gac tgt etc ctg gat gcc cct ggt gcg 
Leu Leu Asp Gly Gly His Gly Asp Cys Leu Leu Asp Ala Pro Gly Ala 
420 425 430 



1296 



gcc ctg ccc etc ccc aca ggc etc ecg ggc cgc atg gcc ctg tac cag 
Ala Leu Pro Leu Pro Thr Gly Leu Pro Gly Arg Met Ala Leu Tyr Gin 
435 440 445 



134 4 



ctg gac cag cag tgc agg cag ate ttt ggg ecg gat ttc cgc cac tgc 
Leu Asp Gin Gin Cys Arg Gin lie Phe Gly Pro Asp Phe Arg His Cys 
450 455 460 



1392 



ccc aac ace tet get cag gac gtc tgc gcc cag ctt tgg tgc cac act 
Pro Asn Thr Ser Ala Gin Asp Val Cys Ala Gin Leu Trp Cys His Thr 
465 470 475 480 



1440 



gat ggg get gag ccc ctg tgc cac acg aag aat ggc age ctg ccc tgg 
Asp Gly Ala Glu Pro Leu Cys His Thr Lys Asn Gly Ser Leu Pro Trp 
485 490 495 



1488 



get gac ggc acg ecg tgc ggg cct ggg cac etc tgc tea gaa ggc age 
Ala Asp Gly Thr Pro Cys Gly Pro Gly His Leu Cys Ser Glu Gly Ser 
500 505 510 



1536 



tgt eta cct gag gag gaa gtg gag agg ccc aag ccc gtg gta gat gga 
Cys Leu Pro Glu Glu Glu Val Glu Arg Pro Lys Pro Val Val Asp Gly 
515 520 525 



1584 



ggc tgg gca ecg tgg gga ccc tgg gga gaa tgt tet egg ace tgt gga 
Gly Trp Ala Pro Trp Gly Pro Trp Gly Glu Cys Ser Arg Thr Cys Gly 
530 535 540 



1632 



gga gga gta cag ttt tea cac cgt gag tgc aag gac ccc gag cct cag 
Gly Gly Val Gin Phe Ser His Arg Glu Cys Lys Asp Pro Glu Pro Gin 



1680 
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545 550 555 560 

aat gga gga aga tac tgc ctg ggt egg aga gcc aag tac cag tea tgc 1728 
Asn Gly Gly Arg Tyr Cys Leu Gly Arg Arg Ala Lys Tyr Gin Ser Cys 
565 570 575 

cac acg gag gaa tgc ccc cct gac ggg aaa age ttc agg gag cag cag 177 6 
His Thr Glu Glu Cys Pro Pro Asp Gly Lys Ser Phe Arg Glu Gin Gin 
580 585 590 

tgt gag aag tat aat gcc tac aat tac act gac atg gac ggg aat etc 1824 
Cys Glu Lys Tyr Asn Ala Tyr Asn Tyr Thr Asp Met Asp Gly Asn Leu 
595 600 605 

ctg cag tgg gtc ccc aag tat get ggg gtg tec ccc egg gac cgc tgc 1872 
Leu Gin Trp Val Pro Lys Tyr Ala Gly Vai Ser Pro Arg Asp Arg Cys 
610 615 620 

aag ttg ttc tgc cga gcc egg ggg agg age gag ttc aaa gtg ttc gag 1920 
Lys Leu Phe Cys Arg Ala Arg Gly Arg Ser Glu Phe Lys Val Phe Glu 
625 630 635 640 

gcc aag gtg att gat ggc acc ctg tgt ggg cca gaa aca ctg gcc ate 1968 
Ala Lys Val lie Asp Gly Thr Leu Cys Gly Pro Glu Thr Leu Ala lie 
645 650 655 

tgt gtc egt ggc cag tgt gtc aag gcc ggc tgt gac cat gtg gtg gac 2016 
Cys Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asp 
660 665 670 

teg cct egg aag ctg gac aaa tgc ggg gtg tgt ggg ggc aaa ggc aac 2064 
Ser Pro Arg Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Asn 
675 680 685 

tec tgc agg aag gtc tee ggg tec etc acc ccc acc aat tat ggc tac 2112 
Ser Cys Arg Lys Val Ser Gly Ser Leu Thr Pro Thr Asn Tyr Gly Tyr 
690 695 700 

aat gac att gtc acc ate cca get ggt gee act aat att gac gtg aag 2160 
Asn Asp lie Val Thr lie Pro Ala Gly Ala Thr Asn lie Asp Val Lys 
705 710 715 720 
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cag egg age 
Gin Arg Ser 



aag acg get 
Lys Thr Ala 



gcc ata gag 
Ala He Glu 
755 

ggc tec ate 
Gly Ser He 
770 

gag eet ctg 
Glu Pro Leu 
785 

oca aaa gtc 
Pro Lys Val 

atg cag age 
Met Gin Ser 

etc cac gca 
Leu His Ala 
835 

tgc ggg gee 
Cys Gly Ala 
850 

gge cag gcc 
Gly Gin Ala 
865 



cac ecg ggt 
His Pro Gly 
725 

gat ggg cag 
Asp Gly Gin 
740 

cag gae ate 
Gin Asp He 

gee ace ctg 
Ala Thr Leu 



aca gtg cag 
Thr Val Gin 
790 

aaa tac acc 
Lys Tyr Thr 
805 

age aaa gag 
Ser Lys Glu 
820 

cag tgg gtg 
Gin Trp Val 



ggc tgg cag 
Gly Trp Gin 

tct gcc acc 
Ser Ala Thr 
870 



gtg cag aac 
Val Gin Asn 



tac ctg etc 
Tyr Leu Leu 
745 

ttg gtg aag 
Leu Val Lys 
760 

gag cgc ctg 
Glu Arg Leu 
775 

etc ctg aca 
Leu Leu Thr 

ttc ttt gtt 
Phe Phe Val 



aga gca acc 
Arg Ala Thr 
825 

ctg ggg gac 
Leu Gly Asp 
840 

agg cga act 
Arg Arg Thr 
855 

tgc aac aag 
Cys Asn Lys 



gat ggg aac 
Asp Gly Asn 
730 

aac ggc aac 
Asn Gly Asn 

ggg ace ate 
Gly Thr He 



cag age ttc 
Gin Ser Phe 
780 

gtc cct ggc 
Val Pro Gly 
795 

cct aat gac 
Pro Asn Asp 
810 

ace aac ate 
Thr Asn He 



tgg tct gag 
Trp Ser Glu 



gta gag tgc 
Val Glu Cys 
860 

get ctg aaa 
Ala Leu Lys 
875 



tac ctg gcg 
Tyr Leu Ala 
735 

ctg gcc ate 
Leu Ala He 
750 

ctg aag tac 
Leu Lys Tyr 
765 

egg ecc ttg 
Arg Pro Leu 

gag gtc ttc 
Glu Val Phe 



gtg gac ttt 
Val Asp Phe 
815 

ate cag ccg 
He Gin Pro 
830 

tgc tct age 
Cys Ser Ser 
845 

agg gac ecc 
Arg Asp Pro 

ecc gag gat 
Pro Glu Asp 



ctg 2208 
Leu 

tct 2256 
Ser 



age 2304 
Ser 

cca 2352 
Pro 

ecc 24C0 

Pro 

800 

age 2448 
Ser 



ctg 2496 
Leu 



acc 2544 
Thr 



tec 2592 
Ser 



gee 2640 

Ala 

880 
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aag ccc tgc gaa age cag ctg tgc ccc ctg tgattcaggg gggcaggggc 2690 
Lys Pro Cys Glu Ser Gin Leu Cys Pro Leu 
885 890 

cagtcttgtg ctcctggaca tgcggtactg aggtgcagac aaggtctcca ctgtggtgac 2750 

tgggtccctt ggccatatca aggcagcacg gcccacccag gcctcccatt gccgcaaccc 2810 

ctccagtact gcacaaattc ctaaggggga agagaaaagg tatggggcgg caaaacctat 2870 

catcaactgt ccawtgnaat ggaacttgct cgggttcaat taaaggcata agttaaagta 2930 

aattcattat gatcaacaga cctcacntca tctgttgcan gatacaacta ntaaaaaaaa 2990 

aaaaaaaaaa aaaaaaaa 3008 

<210> 4 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 

Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 
35 40 45 

Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ala Phe Gly Lys Gly 
50 55 60 

Phe Val Leu Arg Leu Ala Pro Asp Asp Ser Phe Leu Ala Pro Glu Phe 
65 70 75 80 



Lys lie Glu Arg Leu Gly Gly Ser Gly Arg Ala Thr Gly Gly Glu Arg 
85 90 95 



-20- 



Gly r,eu Arg Giy Cys Phe Phe Ser Giy Thr Val Asn Gly Glu Pro Giu 
100 105 110 

Ser Leu Ala Ala Val Ser Leu Cys Arg Gly Leu Ser Gly Ser Phe Leu 
115 120 125 

Leu Asp Gly Glu Glu Phe Thr lie Gin Pro Gin Gly Ala Gly Gly Ser 
130 135 140 

Leu Ala Gin Pro His Arg Leu Gin Arg Trp Gly Pro Ala Gly Ala Arg 
145 150 155 160 

Pro Leu Pro Arg Gly Pro Glu Trp Glu Val Glu Thr Gly Glu Giy Gin 
165 170 175 

Arg Gin Glu Arg Gly Asp His Gin Glu Asp Ser Glu Glu Glu Ser Gin 
180 185 190 

Glu Glu Glu Ala Glu Gly Ala Ser Glu Pro Pro Pro Pro Leu Gly Ala 
195 200 205 

Thr Ser Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr 
210 215 220 

Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 
225 230 235 240 

Gin Asn His lie Leu Thr Leu Met Ser Val Ala Ala Arg lie Tyr Lys 
245 250 255 

His Pro Ser lie Lys Asn Ser lie Asn Leu Met Val Val Lys Val Leu 
260 265 270 

lie Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 

Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro 
290 295 300 



Ser Asp Arg His Pro Glu His Tyr Asp Thr Ala lie Leu Leu Thr Arg 
305 310 315 320 
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Gln Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 

Asp lie Gly Thr lie Cys Asp Pro Asn Lys Ser Cys Ser Val lie Glu 
340 345 350 

Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 

Val Leu Ser Met Pro His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe 
370 375 380 

Gly Pro Met Gly Lys His His Val Met Ala Pro Leu Phe Val His Leu 
385 390 395 400 

Asn Gin Thr Leu Pro Trp Ser Pro Cys Ser Ala Met Tyr Leu Thr Glu 
405 410 415 

Leu Leu Asp Gly Gly His Gly Asp Cys Leu Leu Asp Ala Pro Gly Ala 
420 425 430 

Ala Leu Pro Leu Pro Thr Gly Leu Pro Gly Arg Met Ala Leu Tyr Gin 
435 440 445 

Leu Asp Gin Gin Cys Arg Gin lie Phe Gly Pro Asp Phe Arg His Cys 
450 455 460 

Pro Asn Thr Ser Ala Gin Asp Val Cys Ala Gin Leu Trp Cys His Thr 
465 470 475 480 

Asp Gly Ala Glu Pro Leu Cys His Thr Lys Asn Gly Ser Leu Pro Trp 
485 490 495 

Ala Asp Gly Thr Pro Cys Gly Pro Gly His Leu Cys Ser Glu Gly Ser 
500 505 510 

Cys Leu Pro Glu Glu Glu Val Glu Arg Pro Lys Pro Val Val Asp Gly 
515 520 525 



Gly Trp Ala Pro Trp Gly Pro Trp Gly Glu Cys Ser Arg Thr Cys Gly 
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530 



535 



540 



Gly Gly Val Gin Phe Ser His Arg Glu Cys Lys Asp Pro Glu Pro Gin 
545 550 555 560 

Asn Gly Gly Arg Tyr Cys Leu Gly Arg Arg Ala Lys Tyr Gin Ser Cys 
565 570 575 

His Thr Glu Glu Cys Pro Pro Asp Gly Lys Ser Phe Arg Glu Gin Gin 
580 585 590 

Cys Glu Lys Tyr Asn Ala Tyr Asn Tyr Thr Asp Met Asp Gly Asn Leu 
595 600 605 

Leu Gin Trp Val Pro Lys Tyr Ala Gly Val Ser Pro Arg Asp Arg Cys 
610 615 620 

Lys Leu Phe Cys Arg Ala Arg Gly Arg Ser Glu Phe Lys Val Phe Glu 
625 630 635 640 

Ala Lys Val lie Asp Gly Thr Leu Cys Gly Pro Glu Thr Leu Ala lie 
645 650 655 

Cys Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asp 
660 665 670 

Ser Pro Arg Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Asn 
675 680 685 

Ser Cys Arg Lys Val Ser Gly Ser Leu Thr Pro Thr Asn Tyr Gly Tyr 
690 695 700 

Asn Asp lie Val Thr lie Pro Ala Gly Ala Thr Asn lie Asp Val Lys 
705 710 715 720 



Gin Arg Ser His Pro Gly Val Gin Asn Asp Gly Asn Tyr Leu Ala Leu 
725 730 735 



Lys Thr Ala Asp Gly Gin Tyr Leu Leu Asn Gly Asn Leu Ala lie Ser 
740 745 750 
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Ala He Glu Gin 
755 

Gly Ser He Ala 
770 

Glu Pro Leu Thr 
785 

Pro Lys Val Lys 



Met Gin Ser Ser 
820 

Leu His Ala Gin 
835 

Cys Gly Ala Gly 
850 

Gly Gin Ala Ser 
865 

Lys Pro Cys Glu 



Asp He Leu Val 
760 

Thr Leu Glu Arg 
775 

Val Gin Leu Leu 
790 

Tyr Thr Phe Phe 
805 

Lys Glu Arg Ala 



Trp Val Leu Gly 
840 

Trp Gin Arg Arg 
855 

Ala Thr Cys Asn 
870 

Ser Gin Leu Cys 
885 



Lys Gly Thr He 



Leu Gin Ser Phe 
780 

Thr Val Pro Gly 
795 

Val Pro Asn Asp 
810 

Thr Thr Asn He 
825 

Asp Trp Ser Glu 



Thr Val Glu Cys 
860 

Lys Ala Leu Lys 
875 

Pro Leu 
890 



Leu Lys Tyr Ser 
765 

Arg Pro Leu Pro 



Glu Val Phe Pro 
800 

Val Asp Phe Ser 
815 

He Gin Pro Leu 
830 

Cys Ser Ser Thr 
845 

Arg Asp Pro Ser 



Pro Glu Asp Ala 

880 



<210> 5 
<211> 1203 
<212> PRT 
<213> Bovine 

<400> 5 

Met Asp Pro Pro Ala Gly Ala Ala Gly Arg Leu Leu Cys Pro Ala Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Pro Leu Pro Ala Asp Ala Arg Leu Ala Ala 
20 25 30 

Ala Ala Ala Asp Pro Pro Gly Gly Pro Gin Gly His Gly Ala Glu Arg 
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35 



40 



45 



He Leu Ala Vai Pro Val Arg Thr Asp Ala Gin Gly Arg Leu Val Ser 
50 55 60 

His Val Val Ser Ala Ala Thr Ala Pro Ala Gly Val Arg Thr Arg Arg 
65 70 75 80 

Ala Ala Pro Ala Gin lie Pro Gly Leu Ser Gly Gly Ser Glu Glu Asp 
85 90 95 

Pro Gly Gly Arg Leu Phe Tyr Asn Val Thr Val Phe Gly Arg Asp Leu 
100 105 110 

His Leu Arg Leu Arg Pro Asn Ala Arg Leu Val Ala Pro Gly Ala Thr 
115 120 125 

Val Glu Trp Gin Gly Glu Ser Gly Ala Thr Arg Val Glu Pro Leu Leu 
130 135 140 

Gly Thr Cys Leu Tyr Val Gly Asp Val Ala Gly Leu Ala Glu Ser Ser 
145 150 155 160 

Ser Val Ala Leu Ser Asn Cys Asp Gly Leu Ala Gly Leu He Arg Met 
165 170 175 

Glu Glu Glu Glu Phe Phe He Glu Pro Leu Glu Lys Gly Leu Ala Ala 
180 185 190 

Lys Glu Ala Glu Gin Gly Arg Val His Val Val Tyr His Arg Pro Thr 
195 200 205 

Thr Ser Arg Pro Pro Pro Leu Gly Gin Ala Leu Asp Thr Gly He Ser 
210 215 220 

Ala Asp Ser Leu Asp Ser Leu Ser Arg Ala Leu Gly Val Leu Glu Glu 
225 230 235 240 



Arg Val Asn Ser Ser Arg Arg Arg Met Arg Arg His Ala Ala Asp Asp 
245 250 255 
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Asp Tyr Asn He Glu Val Leu Leu Gly Val Asp Asp Ser Val Val Gin 
260 265 270 

Phe His Gly Thr Glu His Val Gin Lys Tyr Leu Leu Thr Leu Met Asn 
275 280 285 

He Val Asn Glu He Tyr His Asp Glu Ser Leu Gly Ala His He Asn 
290 295 300 

Val Val Leu Val Arg He He Leu Leu Ser Tyr Gly Lys Ser Met Ser 
305 310 315 320 

Leu He Glu He Gly Asn Pro Ser Gin Ser Leu Glu Asn Val Cys Arg 
325 330 335 

Trp Ala Tyr Leu Gin Gin Lys Pro Asp Thr Asp His Asp Glu Tyr His 
340 345 350 

Asp His Ala He Phe Leu Thr Arg Gin Asp Phe Gly Pro Ser Gly Met 
355 360 365 

Gin Gly Tyr Ala Pro Val Thr Gly Met Cys His Pro Val Arg Ser Cys 
370 375 380 

Thr Leu Asn His Glu Asp Gly Phe Ser Ser Ala Phe Val Val Ala His 
385 390 395 400 

Glu Thr Gly His Val Leu Gly Met Glu His Asp Gly Gin Gly Asn Arg 
405 410 415 

Cys Gly Asp Glu Val Arg Leu Gly Ser He Met Ala Pro Leu Val Gin 
420 425 430 

Ala Ala Phe His Arg Phe His Trp Ser Arg Cys Ser Gin Gin Glu Leu 
435 440 445 



Ser Arg Tyr Leu His Ser Tyr Asp Cys Leu Arg Asp Asp Pro Phe Thr 
450 455 460 



His Asp Trp Pro Ala Leu Pro Gin Leu Pro Gly Leu His Tyr Ser Met 
465 470 475 480 
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Asn Glu Gin Cys Arg Phe Asp Phe Gly Leu Gly Tyr Met Met Cys Thr 
485 490 495 

Ala Phe Arg Thr Phe Asp Pro Cys Lys Gin Leu Trp Cys Ser His Pro 
500 505 510 

Asp Asn Pro Tyr Phe Cys Lys Thr Lys Lys Gly Pro Pro Leu Asp Gly 
515 520 525 

Thr Met Cys Ala Pro Gly Lys His Cys Phe Lys Gly His Cys He Trp 
530 535 540 

Leu Thr Pro Asp He Leu Lys Arg Asp Gly Asn Trp Gly Ala Trp Ser 
545 550 555 560 

Pro Phe Gly Ser Cys Ser Arg Thr Cys Gly Thr Gly Val Lys Phe Arg 
565 570 575 

Thr Arg Gin Cys Asp Asn Pro His Pro Ala Asn Gly Gly Arg Thr Cys 
580 585 590 

Ser Gly Leu Ala Tyr Asp Phe Gin Leu Cys Asn Ser Gin Asp Cys Pro 
595 600 605 

Asp Ala Leu Ala Asp Phe Arg Glu Glu Gin Cys Arg Gin Trp Asp Leu 
610 615 620 

Tyr Phe Glu His Gly Asp Ala Gin His His Trp Leu Pro His Glu His 
625 630 635 640 

Arg Asp Ala Lys Glu Arg Cys His Leu Tyr Cys Glu Ser Lys Glu Thr 
645 650 655 

Gly Glu Val Val Ser Met Lys Arg Met Val His Asp Gly Thr Arg Cys 
660 665 670 

Ser Tyr Lys Asp Ala Phe Ser Leu Cys Val Arg Gly Asp Cys Arg Lys 
675 680 685 



Val Gly Cys Asp Gly Val lie Gly Ser Ser Lys Gin Glu Asp Lys Cys 
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690 695 700 

Gly Val Cys Gly Giy Asp Asn Ser His Cys Lys Val Val Lys Gly Thr 
705 710 715 720 

Phe Ser Arg Ser Pro Lys Lys Leu Gly Tyr lie Lys Met Phe Glu lie 
725 730 735 

Pro Ala Gly Ala Arg His Leu Leu lie Gin Glu Ala Asp Thr Thr Ser 
740 745 750 

His His Leu Ala Val Lys Asn Leu Glu Thr Gly Lys Phe lie Leu Asn 
755 760 765 

Glu Glu Asn Asp Val Asp Pro Asn Ser Lys Thr Phe lie Ala Met Gly 
770 775 780 

Val Glu Trp Glu Tyr Arg Asp Glu Asp Gly Arg Glu Thr Leu Gin Thr 
785 790 795 800 

Met Gly Pro Leu His Gly Thr He Thr Val Leu Val He Pro Glu Gly 
805 810 815 

Asp Ala Arg He Ser Leu Thr Tyr Lys Tyr Met He His Glu Asp Ser 
820 825 830 

Leu Asn Val Asp Asp Asn Asn Val Leu Glu Asp Asp Ser Val Gly Tyr 
835 840 845 

Glu Trp Ala Leu Lys Lys Trp Ser Pro Cys Ser Lys Pro Cys Gly Gly 
850 855 860 

Gly Ser Gin Phe Thr Lys Tyr Gly Cys Arg Arg Arg Leu Asp His Lys 
865 870 875 880 

Met Val His Arg Gly Phe Cys Asp Ser Val Ser Lys Pro Lys Ala He 
885 890 895 



Arg Arg Thr Cys Asn Pro Gin Glu Cys Ser Gin Pro Val Trp Val Thr 
900 905 910 
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Gly Glu Trp Glu Pro Cys Ser Arg Ser Cys Gly Arg Thr Gly Met Gin 
915 920 925 

Val Arg Ser Val Arg Cys Val Gin Pro Leu His Asn ?\sn Thr Thr Arg 
930 935 940 

Ser Val His Thr Lys His Cys Asn Asp Ala Arg Pro Glu Gly Arg Arg 
945 950 955 960 

Ala Cys Asn Arg Glu Leu Cys Pro Gly Arg Trp Arg Ala Gly Ser Trp 
965 970 975 

Ser Gin Cys Ser Val Thr Cys Gly Asn Gly Thr Gin Glu Arg Pro Val 
980 985 990 

Leu Cys Arg Thr Ala Asp Asp Ser Phe Gly Val Cys Arg Glu Glu Arg 
995 1000 1005 

Pro Glu Thr Ala Arg lie Cys Arg Leu Gly Pro Cys Pro Arg Asn Thr 
1010 1015 1020 

Ser Asp Pro Ser Lys Lys Ser Tyr Val Val Gin Trp Leu Ser Arg Pro 
1025 1030 1035 1040 

Asp Pro Asn Ser Pro Val Gin Glu Thr Ser Ser Lys Gly Arg Cys Gin 
1045 1050 1055 

Gly Asp Lys Ser Val Phe Cys Arg Met Glu Val Leu Ser Arg Tyr Cys 
1060 1065 1070 

Ser lie Pro Gly Tyr Asn Lys Leu Cys Cys Lys Ser Cys Asn Pro His 
1075 1080 1085 

Asp Asn Leu Thr Asp Val Asp Asp Arg Ala Glu Pro Pro Ser Gly Lys 
1090 1095 1100 

His Asn Asp lie Glu Glu Leu Met Pro Thr Leu Ser Val Pro Thr Leu 
1105 1110 1115 1120 



Val Met Glu Val Gin Pro Pro Pro Gly lie Pro Leu Glu Val Pro Leu 
1125 1130 1135 
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Asn Thr Ser Ser Thr Asn Ala Thr Glu Asp His Pro Glu Thr Asn Ala 
1140 1145 1150 

Val Asp Val Pro Tyr Lys lie Pro Gly Leu Glu Asp Glu Val Gin Pro 
1155 1160 1165 

Pro Asn Leu lie Pro Arg Arg Pro Ser Pro Tyr Glu Lys Thr Arg Asn 
1170 1175 1180 

Gin Arg lie Gin Glu Leu lie Asp Glu Met Arg Lys Lys Glu Met Leu 
1185 1190 1195 1200 

Gly Lys Phe 



<210> 6 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Asp Asp Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser 
15 10 15 

Cys Gly Asn Gly lie Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn 
20 25 30 

Asn Arg Cys Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His lie Gin 
35 40 45 



Glu Cys 
50 



<210> 7 
<211> 57 
<212> PRT 

<213> Homo sapiens 
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<400> 7 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 

Cys Gly Asp Gly Val lie Thr Arg lie Arg Leu Cys Asn Ser Pro Ser 
20 25 30 

Pro Gin Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu Thr Lys 
35 40 45 

Ala Cys Lys Lys Asp Ala Cys Pro lie 
50 55 

<210> 8 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Asn Gly Gly Trp Gly Pro Trp Ser Pro Trp Asp lie Cys Ser Val Thr 
15 10 15 

Cys Gly Gly Gly Val Gin Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr 
20 25 30 

Pro Gin Phe Gly Gly Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gin 
35 40 45 

lie Cys Asn Lys Gin Asp Cys Pro lie 
50 55 

<210> 9 

<211> 50 

<212> PRT 

<213> Homo sapiens 



<400> 9 

Glu Glu Gly Trp Ser Pro Trp Ala Glu Trp Thr Gin Cys Ser Val Thr 
15 10 15 
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Cys Gly Ser Gly Thr Gin Gin Arg Gly Arg Ser Cys Asp Vai Thr Ser 
20 25 30 

Asn Thr Cys Leu Gly Pro Ser lie Gin Thr Arg Ala Cys Ser Leu Ser 
35 40 45 

Lys Cys 
50 

<210> 10 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 

Cys Gly Val Gly Asn lie Thr Arg lie Arg Leu Cys Asn Ser Pro Val 
20 25 30 

Pro Gin Met Gly Gly Lys Asn Cys Lys Gly Ser Gly Arg Glu Thr Lys 
35 40 45 

Ala Cys Gin Gly Ala Pro Cys Pro lie 
50 55 

<210> 11 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Asp Gly Arg Trp Ser Pro Trp Ser Pro Trp Ser Ala Cys Thr Val Thr 
15 10 15 



Cys Ala Gly Gly lie Arg Glu Arg Thr Arg Val Cys Asn Ser Pro Glu 
20 25 30 
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Pro Gin Tyr Gly Gly Lys Ala Cys Val Gly Asp Val Gin Glu Arg Gin 
35 40 45 

Met Cys Asn Lys Arg Ser Cys Pro 
50 55 



<210> 12 

<211> 3974 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ggtacctaag tgagtagggc gtccgatcga cggacgcctt ttttttgaat tcgtaatcat 60 
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 120 
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 180 
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 240 
tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 300 
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 360 
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 420 
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 480 
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 540 
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 600 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 660 
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 720 
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 780 
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 840 
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cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 900 
gaagaacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 960 
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 1020 
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 1080 
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcgtcga 114 0 
caattcgcgc gcgaaggcga agcggcatgc atttacgttg acaccatcga atggtgcaaa 1200 
acctttcgcg gtatggcatg atagcgcccg gaagagagtc aattcagggt ggtgaatgtg 1260 
aaaccagtaa cgttatacga tgtcgcagag tatgccggtg tctcttatca gaccgtttcc 1320 
cgcgtggtga accaggccag ccacgtttct gcgaaaacgc gggaaaaagt ggaagcggcg 138 0 
atggcggagc tgaattacat tcccaaccgc gtggcacaac aactggcggg caaacagtcg 14 4 0 
ttgctgattg gcgttgccac ctccagtctg gccctgcacg cgccgtcgca aattgtcgcg 1500 
gcgattaaat ctcgcgccga tcaactgggt gccagcgtgg tggtgtcgat ggtagaacga 1560 
agcggcgtcg aagcctgtaa agcggcggtg cacaatcttc tcgcgcaacg cgtcagtggg 1620 
ctgatcatta actatccgct ggargaccag gatgccattg ctgtggaagc tgcctgcact 168 0 
aatgttccgg cgttatttct tgatgtctct gaccagacac ccatcaacag tattattttc 174 0 
tcccatgaag acggtacgcg actgggcgtg gagcatctgg tcgcattggg tcaccagcaa 1800 
atcgcgctgt tagcgggccc attaagttct gtctcggcgc gtctgcgtct ggctggctgg 1860 
cataaatatc tcactcgcaa tcaaattcag ccgatagcgg aacgggaagg cgactggagt 1920 
gccatgtccg gttttcaaca aaccatgcaa atgctgaatg agggcatcgt tcccactgcg 1980 
atgctggttg ccaacgatca gatggcgctg ggcgcaatgc gcgccattac cgagtccggg 204 0 



-34- 



ctgcgcgttg gtgcggatat ctcggtagtg ggatacgacg ataccgaaga cagctcatgt 2100 
tatatcccgc cgttaaccac catcaaacag gattttcgcc tgctggggca aaccagcgtg 2160 
gaccgcttgc tgcaactctc tcagggccag gcggtgaagg gcaatcagct gttgcccgtc 2220 
tcactggtga aaagaaaaac caccctggcg cccaatacgc aaaccgcctc tccccgcgcg 2280 
ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 2340 
gcgcaacgca attaatgtaa gttagcgcga attgtcgacc aaagcggcca tcgtgcctcc 2400 
ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 24 60 
gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 2520 
ccaactcgcg aggggatcga gcccggggtg ggcgaagaac tccagcatga gatccccgcg 2580 
ctggaggatc atccagccgg cgtcccggaa aacgattccg aagcccaacc tttcatagaa 2 640 
ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc 2700 
gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc 2760 
gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc 2820 
tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc 2880 
cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag 2940 
gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg 3000 
aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga 3060 
ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg 3120 
caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc 3180 
tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc 3240 
cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg 3300 
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gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg 3360 
gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag 3420 
cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga 3480 
gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga 3540 
tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat ccagtttact 3600 
ttgcagggct tcccaacctt accagagggc gccccagctg gcaattccgg ttcgcttgct 3660 
gtccataaaa ccgcccagtc tagctatcgc catgtaagcc cactgcaagc tacctgcttt 3720 
ctctttgcgc ttgcgttttc ccttgtccag atagcccagt agctgacatt catccggggt 3780 
cagcaccgtt tctgcggact ggctttctac gtgttccgct tcctttagca gcccttgcgc 3840 
cctgagtgct tgcggcagcg tgaagcttaa aaaactgcaa aaaatagttt gacttgtgag 3900 
cggataacaa ttaagatgta cccaattgtg agcggataac aatttcacac attaaagagg 3960 
agaaattaca tatg 3974 



<210> 13 

<211> 112 

<212> DNA 

<213> Homo sapiens 

<400> 13 

aagcttaaaa aactgcaaaa aatagtttga cttgtgagcg gataacaatt aagatgtacc 60 
caattgtgag cggataacaa tttcacacat taaagaggag aaattacata tg 112 



<210> 14 

<211> 542 

<212> DNA 

<213> Mus musculus 



<220> 

<221> UNSURE 
<222> (3) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (22) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (361) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (369) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (407) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (427) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (479) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (482) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (535) 

<223> May be any nucleic acid 
<400> 14 

gtncgaattt cggcacgaga nnttagacgc cttttcatgg aagctgggga atgtgggggc 60 
cttggggaga ctgttcgaga acgtgcggtg gaggagtcca gtacacgatg agggaatgtg 120 
acaacccagt cccaaagaat ggagggaagt actgtgaagg caaacgagtg cgctacagat 180 
cctgtaacct tgaggactgt ccagacaata atggaaaaac ctttagagag gaacaatgtg 24 0 
aagcacacaa cgagttttca aaagcttcct ttgggagtgg gcctgcggtg gaatggattc 300 
ccaagtacgc tggcgtctca ccaaaggaca ggtgcaagtt catgttgcca agccaaaggc 360 
nttggctant tctttcgttt tgcagcccaa ggttgttagg tgggtantcc atgttaggcc 420 
cagattncac ctttgtctgt gtgcaaggac agtgtgttaa aagttggttg tgatccgcnt 480 
cntagattcc aaaaggagtt ttgttaatgt ggtgttttcn gggggaatgg tctantttta 540 
aa 542 

<210> 15 
<211> 320 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 15 

cagagaacat tcgccccact cttcaatgac ccatgctgaa aaagtgggga tagcattgaa 60 
agattccttc ttcttcttta cgaagtaggt gtatttaatt ttaggtcgaa gggcattgcc 120 
cacagtaaga acctggatgg tcaagggctc tttgagaggg ctaaagctgc gaattctttc 180 
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caatgccgca gaggagccgc tgtacctcaa gacaacacct ttgtacataa tgtcttgctc 240 
taaggtggac aaagtgtagt caccattaag aatatatgtg ccatcagcag ctttgatggc 300 
aagaaagctg cccttgttcc 320 

<210> 16 
<211> 316 
<212> DNA 

<213> Eimeria tenella 
<400> 16 

aatgccgaga cattaatgga cagcctgctt ccgagtgtgc aaaggaagtg aagccagcca 60 
gcaccagacc ttgtgcagac catccctgcc cccagtggca gctgggggag tggtcatcat 120 
gttctaagac ctgtgggaag ggttacaaaa aaagaagctt gaagtgtctg tcccatgatg 180 
gaggggtgtt atctcatgag agctgtgatc ctttaaagaa acctaaacat ttcatagact 240 
tttgcacaat ggcagaatgc agttaagtgg tttaagtggt gttagctttg agggcaaggc 300 
aaagtgagga agggct 316 



<210> 17 
<211> 383 
<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> UNSURE 
<222> (160) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (326) 

<223> May be any nucleic acid 
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<220> 



<221> 


UNSURE 








<222> 


(358) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 










<222> 


(366) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 


UNSURE 








<222> 


(377) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 


UNSURE 








<222> 


(379) 








<223> 


May be 


any 


nucleic 


acid 


<400> 


17 









gtcgacccac gcgtccggat ggtactccat gtagcccaga ttccacctct gtctgtgtgc 60 
aaggacagtg tgtaaaagct ggttgtgatc gcatcataga ctccaaaaag aagtttgata 120 
aatgtggtgt ttgcggggga aatggatcta cttgtaaaan aatatcagga tcagttacta 180 
gtgcaaaacc tgggatatca tgatatcatc acaattccaa ctgggagcca ccaacatcga 24 0 
agtgaaacag cggaaccaga ggggatccag ggaacaatgg gcagctttct tgccatcaaa 300 
gctgctggat ggcacatata ttcttnaatg gtgactacac tttgtccacc ttagaganag 360 
acattntgtg acaaagngnt tgt 383 



<210> 18 
<211> 404 
<212> DNA 

<213> Crotalus atrox 



<220> 
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<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (301) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (335) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (373) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (378) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (382) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (383) 

<223> May be any nucleic acid 
<400> 18 

cccacgcgtc cgcccacggt nccgggactt gtgtgggtcc cagacatgtg atactcttgg 60 
gatggctgat gttggaactg tgtgtgatcc gagcagaagc tgctccgtca tagaagatga 120 
tggtttacaa gctgccttca ccacagccca tgaattaggc cacgtgttta acatgccaca 180 
tgatggatgc aaagcagtgt gccagcctta aatggtgtga accagggatt cccacatgat 240 



ggcgtcaatg ctttccaacc tgggaccaca gccagccttg ggtcctcctt gcagtggcct 300 
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nacatggatt gacatcattt ctgggatgaa tggtncatgg gggaatgttt tgattggaca 360 
agccttcaga atnccctnac annttcccag gggttctccc tggg 4 04 



<210> 19 

<211> 152 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (105) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (122) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 

<223> May be any nucleic acid 
<400> 19 

atcgtagaag atgaaaaatg gggcccagag gtgtccgaca atggggggct tacactgcgt 60 
aacttctgca actggcagcg gcgtttcaac cagcccagcg accgncaccc agagcactac 120 
gncacggcca tcctnctcac cagacagaac tt 152 



<210> 20 
<211> 4180 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 20 
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gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tccttgctca gatcgagtca aatcgggcca aggcggagga cgaagagtcc aggctcctat 12C 
tctggacttg ttccccagct ccgggggcgc ttctaggtcc tgcagcagcc agcagtgcgg 180 
agccaccaac tcggtgctgg aatgaaaaaa ttcccgcgcg ccagtgcaga atctttctaa 240 
gtgacccgga gcttcgggtg ctagctctgc acgaactttc ccatcaaagt gatcgtgaat 300 
tttaagcatc aggagcaggc cagcgaagct ctacgcgtct aaacgtctat ccagaccaag 360 
agttctctgc ggtgcagggt gcggtgccat gcagccaaaa gtcccttttg gggtcacgca 420 
agcagaagcc ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 480 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 540 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 84 0 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 
acgagaccct gccaaccagc gactcgcgac ccgagagcca gaacacccgg aaccagtggc 1080 
ctgtgcggga ccccacgcct caggacgcgg gaaagccatc aggaccagga agcataagga 1140 
agaagcgatt tgtgtccagc ccccgttatg tggaaaccat gctcgtagct gaccagtcca 1200 
tggccgactt ccacggcagc ggtctaaagc attaccttct aaccctgttc tcggtggcag 1260 



-43- 



ccaggtttta caagcatccc agcattagga attcaattag cctggtggtg gtgaagatct 1320 
tggtcatata cgaggagcag aagggaccag aagttacctc caatgcagct ctcacccttc 1380 
ggaatttctg cagctggcag aaacaacaca acagccccag tgaccgggat ccagagcact 1440 
atgacactgc aattctgttc accagacagg atttatgtgg ctcccacacg tgtgacactc 1500 
tcggaatggc agatgttgga accgtatgtg accccagcag gagctgctca gtcatagaag 1560 
atgatggttt gcaagccgcc ttcaccacag cccatgaatt gggccatgtg tttaacatgc 1620 
cgcacgatga tgctaagcac tgtgccagct tgaatggtgt gagtggcgai: tctcatctga 1680 
tggcctcgat gctctccagc ttagaccata gccagccctg gtcaccttgc agtgcctaca 1740 
tggtcacgtc cttcctagat aatggacacg gggaatgttt gatggacaag ccccagaatc 1800 
caatcaagct cccttctgat cttcccggta ccttgtacga tgccaaccgc cagtgtcagt 18 60 
ttacattcgg agaggaatcc aagcactgcc ctgatgcagc cagcacatgt actaccctgt 1920 
ggtgcactgg cacctccggt ggcttactgg tgtgccaaac aaaacacttc ccttgggcag 1980 
atggcaccag ctgtggagaa gggaagtggt gtgtcagtgg caagtgcgtg aacaagacag 2040 
acatgaagca ttttgctact cctgttcatg gaagctgggg accatgggga ccgtggggag 2100 
actgctcaag aacctgtggt ggtggagttc aatacacaat gagagaatgt gacaacccag 2160 
tcccaaagaa cggagggaag tactgtgaag gcaaacgagt ccgctacagg tcctgtaaca 2220 
tcgaggactg tccagacaat aacggaaaaa cgttcagaga ggagcagtgc gaggcgcaca 2280 
atgagttttc caaagcttcc tttgggaatg agcccactgt agagtggaca cccaagtacg 2340 
ccggcgtctc gccaaaggac aggtgcaagc tcacctgtga agccaaaggc attggctact 2400 
ttttcgtctt acagcccaag gttgtagatg gcactccctg tagtccagac tctacctctg 2460 



tctgtgtgca agggcagtgt gtgaaagctg 
agtttgataa gtgtggcgtt tgtggaggaa 
tagtcactag tacaagacct gggtatcatg 
acattgaagt gaaacatcgg aatcaaaggg 
ttagagccgc tgatggtacc tatattctga 
aagacctcac ctacaaaggt actgtcttaa 
gaatccgcag ctttagtcca ctcaaagaac 
atgctctccg acccaaaatt aaattcacct 
acgccattcc cacattttct gagtgggtga 
gcggctcagg ttggcagaga agagtagtgc 
ccgaatgtgc aaaggaagtg aagccagcca 
cacactggca ggtgggggat tggtcaccat 
agagaacctt gaaatgtgtg tcccacgatg 
ctttgaagaa gccaaagcat tacattgact 
cgttagagga caaggtagcg tggggagggg 
gatccagtga gtcaaaccag taagcagtga 
catagcaaag gaggtagatc aggacactac 
atgaggcaca gtagcatctg aaagaccata 
agtatctctt ttcttatatc tatcgcccaa 
tgcactgtac taactagata cttcttatca 
ggtaagactg ggtttcaaac aaggcttggt 
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gctgtgatcg catcatagac tccaaaaaga 2520 
acggttccac atgcaagaag atgtcaggaa 2580 
acattgtcac aattcctgct ggagccacca 2 54 0 
ggtccagaaa caatggcagc tttctggcta 2700 
atggaaactt cactctgtcc acactagagc 27 60 
ggtacagtgg ttcctcggct gcgctggaaa 2820 
ccttaaccat ccaggttctt atggtaggcc 2880 
actttatgaa gaagaagaca gagtcattca 2940 
ttgaagagtg gggggagtgc tccaagacat 3000 
agtgcagaga cattaacgga caccctgctt 3060 
gtaccagacc ttgtgcagac cttccttgcc 3120 
gttccaaaac ttgcgggaag ggttacaaga 3180 
ggggcgtgtt atcaaatgag agctgrgatc 3240 
tttgcacact gacacagtgc agttaagagg 3300 
ctgatacact gagtgcaaga gtactggagg 3360 
ggtgtggcaa ggaggtgtgt gtaggggata 3420 
cctgccagtt acattctgat aaggtagtta 3480 
cagagcacta aggagcccca aagcactatt 3540 
ataattttca gagtctggca gaagccctgt 3600 
caaagattgg gaaaggcaaa gcagaaagat 3660 
ttcaatcact ggaggcaagg aggaggggac 3720 
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aaacaagatc attat.tcgaa gtcgctggtt gctgtggttt tacggaaggt tgatgcatca 3780 
ttcctatcaa cagtgaaaag ttcagcttgt tcaacgtgac agaaaggctc atctccgtga 3840 
aagagctcct gatttcttct tacaccatct cagttcttaa ctatagttca tgttgaggta 3900 
gaaacaattc atctatttat aaaatgtaca ttggaaaaaa aaagtgaagt ttatgaggta 3960 
cacataaaaa ctgaaggaaa caatgagcaa catgcctcct gctttgcttc ctcctgaggt 4020 
aaacctgcct ggggattgag gttgtttaag attatccatg gctcacaaga ggcagtaaaa 4 080 
taatacatgt tgtgccagag ttagaatggg gtatagagat cagggtccca tgagatgggg 4140 
aacatggtga tcactcatct cacatgggag gctgctgcag 4180 

<210> 21 
<211> 9248 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 21 

gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tccttgctca gatcgagtca aatcggggcc aaggcggagg acgaagagtc caggctccta 120 
ttctggactt gttccccagc tccgggggcg cttctaggtc ctgcagcagc caggagtgcg 180 
gagccaccaa ctcggtgctg gaatgaaaaa attcccgcgc gccagtgcag aatctttcta 240 
agtgacGcgg agcttcgggt gctagctctg cacgaacttt cccatcaaag tgatcgtgaa 300 
ttttaagcat caggagcagg ccagcgaagc tctacgcgtc taaacgtcta tccagaccaa 360 
gagttctctg cggtgcaggg tgcggtgcca tgcagccaaa agtccctttg gggtcacgca 420 
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agcagaagcc ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 4 80 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 540 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 840 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 
acgagaccct gccaaccagc gactcgcgac ccgagagcca gaacacccgg aaccagtggc 1080 
Gtgtgcggga ccccacgcct caggacgcgg gaaagccatc aggtataaga gtgaccccca 1140 
tctctcagtc tttacgaggc gtgacttggg gtcacactcc agatcgcctc taaatgcgaa 1200 
tgactcagac ttgcagtgaa ttgaagttct gggtcgtgac cttcccgctc cccccccccc 1260 
aaaaaaagtg tgaccatact ctgctagaac acttatttgc ccgaatagtt aataatttga 1320 
gaaagagaga aagaatcgga ggtcctgtag ataagggcta agcgtttcct ccgcgaagcc 1380 
aataacccga ctccttacac tggagaatct ctctccatcc ctttaatgcc tttagtgaat 1440 
gtatgagttc actttaacta ggttgtagtt tcgcgctgag ttttgtaacg tcagtccgtg 1500 
tgagcacgta gcgctcaaag gagggcggag tagaggagcc atggtgaccc ggatgtgcgt 15 60 
tcaggagcct gggcaacggc agtggtgatc tcatttctgt ggccttccgt ctgtcccctt 1620 
cccccatttg aaaagctgac cccgatggct ggtggctccg ttgggcccct ctgcagaacc 1680 
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tgcttgggag gtctttgctt ggttcgcccc gcctccacgc gcctcctacc tcggcctcgt 1740 
tgctcgcact ccctctcccg gcagaggttg gactccccag cgctgtggaa tgttagcctg 1800 
gactgatcct ccctgctaca cattcgcctg actctgccgt gttcagtctc taccagccag 1860 
ttagttcttt ttaatcattc aaatttcttt ttgccctttt ctagatttct ccctcttttc 1920 
cgacttgtcc ctaggagctg gtattcatat cctactttac gatttctctg accgctgagt 1980 
ctcagcagcc cgaaaaaggc cattttccaa attggcaacc ctggtttgag aaaggaactt 2040 
attccccccg gggcactggg agtgagagga ggcaggaaaa cactgctggg cagagtgggt 2100 
ggtcctagtg cccggaactg gatcaagcag agaaccccct gggacccctt gaatgagaga 2160 
gctgagcctt acagactgag actcctcaag ccccacccct tggctgagct ccccgccctg 2220 
ccccatgcct tccacgtgga gctggatgat ctcattcggg atttcagccc tggcttcaat 2280 
agtgaaaggg tgactcaggg cgtccgcctg cttctcttgc caagttttta ctacagctgg 2340 
gtagaaatga tagccatact gcctcactca ggctgtggag tcttcaaaga ccacaaaaga 2400 
aatctgcgga cacatatata gacagtttga tcactctgtt gcttgctttg ttttgttttg 2460 
ttttgtctta tttaaagcaa aagaaaaaag acttaaaaat aactcacagt ttttagaaga 2520 
tgcaaatatt tgttttattt ttgttccagg tgtatttcag ttttatttac tttgactagg 2580 
ttgactttcc taatataccc cgagaaggtc actattagga gaaggactgc ccatgagcaa 2640 
acttcctttt ctttttacag gaccaggaag cataaggaag aagcgatttg tgtccagccc 2700 
ccgttatgtg gaaaccatgc tcgtggctga ccagtccatg gccgacttcc acggcagcgg 2760 
tctaaagcat taccttctaa ccctgttctc ggtggcagcc aggttttaca agcatcccag 2820 
cattaggaat tcaattagcc tggtggtggt gaagatcttg gtcatatatg aggagcagaa 2880 



gggaccagaa gttacctcca atgcagctct 
acaacacaac agccccagtg accgggatcc 
cagacaggta agacaggagc ttatcaacca 
gtgttggatg ggatgagagg gtgggggtgt 
tttgaaatgt aaacaaaata accaattaaa 
tttcctttcg gttataggct agctcaatct 
atctcaagtt ctacagttct attttaaaag 
cttcaagccc agcaataatg aatggacagg 
tgaagagcta agtatcactc tctcctactt 
tttatgtggc tcccacacgt gtgacactct 
ccccagcagg agctgctcag tcatagaaga 
ccacgaattg ggtaagtcgg cttcagagta 
caataagtca atctgatgtg acgagagaga 
caccagcaat tttagaaggg gtagggtata 
ggacaggaga aagtggtaca aacatttgaa 
agtgagtatg agccccaata caggttgtct 
atacagctga atgtccttct tgtctgcttc 
gatgctaagc actgtgccag cttgaatggt 
atgctctcca gcttagacca tagccagccc 
tccttcctag ataatggaca cggtaagatg 
ttccttgtgt agggctctct ctggctaagt 
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cacccttcgg aatttctgca actggcagaa 2940 
agagcactat gacactgcaa ttctgttcac 3000 
tttcatcaac tcaactcgga ggtcagcctt 3060 
ggcggagagg aaacccagaa ggggatgaca 3120 
aaaaaaaggc atctcatctg tattgcctca 3180 
gtcttgctta tttctatttt aaacttccac 3240 
cattacaggg aatcttgctt agagtcagtc 3300 
cttcaaagtg catgtgaaga cacgcccaac 3360 
aaaagggatt tcccttgcct ctttgtagga 3420 
cgggatggca gatgttggaa ctgtatgtga 3480 
tgatggtttg caagccgcct tcaccacagc 3540 
caagttaagc ccaaatgcat ggatacaacc 3600 
aaacatctca gactatgttg ctacctcagc 3660 
ttttccacga tttcaagtat ggtcttacta 3720 
cgttgacatt tttatacttg ccctgatcaa 3780 
aataagagag ccattgagcc tcactcaata 3840 
ccaggccatg tgtttaacat gccgcacgat 3900 
gtgactggcg attctcatct gatggcctcg 3960 
tggtcacctt gcagtgccta catggtcacg 4 020 
acagctcctc tttccagatg gtgttcaacc 4080 
gagctccatg gctcttgctc atttcccctc 4140 
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cttcagagtt ttctctggca ggatcataag tagtagatct ttacctccat tgcatcctgc 4200 
tcccaaagtc cattcattca taaacaataa cttctcgcca ttgtaaaatc agaagtcccc 4260 
tattgaggat aacgtctcga taaaaatcta aagttcccta gcattgattt tcccaaaaat 432C 
gcatgatttc accaaacatg tattaataat tgcctctttt ttcttttcct tttttttttt 4380 
tattatttta ggggaatgtt tgatggacaa gccccagaat ccaatcaagc tcccttctga 4440 
tcttcccggt accttgtacg atgccaaccg ccagtgtcag tttacattcg gagaggaatc 4 500 
caagcactgc cctgatgcag ccagcacatg tactaccctg tggtgcactg gcacctccgg 4560 
tggcttactg gtgtgccaaa caaaacactt cccttgggca gatggcacca gctgtggaga 4620 
agggaagtgg tgtgtcagtg gcaagtgcgt gaacaagaca gacatgaagc attttgctgt 4 680 
gagttttcGC aatgaaacat atccgtttgc aactcagggt tgagaagggc aaagtgatgg 4740 
tttagttcct ttcctagaca aactcctcta cctgtgtcct gtagtgggac tatgagatgg 4800 
tagcgtattt tgagaatrga ttgtctgttt tacatttttc tctgattccc taaaargtct 4860 
ttatagttct aacactgata tctgtatctc catttagact cctgttcatg gaagctgggg 4 920 
accatgggga ccgtggggag actgctcaag aacctgtggt ggtggagttc aatacacaat 4 980 
gagagaatgt gacaacccag tcccaaagaa cggagggaag tactgtgaag gcaaacgagt 5040 
ccgctacagg tcctgtaaca tcgaggactg tccagacaat aacggtgagt catactggac 5100 
ttcagctctc agaaaccggg caaaggcggc gtgccacaac atgtggttgg aagttggaaa 5160 
ctgggaacat catcgccgtc gttctctttt caggaaaaac gttcagagag gagcagtgcg 5220 
aggcgcacaa tgagttttcc aaagcttcct ttgggaatga gcccactgta gagtggacac 5280 
ccaagtacgc cggcgtctcg ccaaaggaca ggtgcaagct cacctgtgaa gccaaaggca 5340 
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ttggctactt tttcgtctta cagcccaagg taggtgcttt tacacttgaa tctttgcaaa 5400 
ggagcctcag ctgggcttgc tgccatgcca tacaaatgtt tgggctgtct ttacctattg 54 60 
atctgtgttc cgttttgaat ttggaatact tctaaatgca ggaacaactc cttgcttrgg 5520 
gatttgttgt tgccttctgt tgggaaggaa gcttaaatct agctagcact taaaagagtc 5580 
ttgcatgtgt ttaatattgc ttctctatcc ccaaagaatg gccctttgaa aactcaagag 5640 
ccctctctgt ataactaggt ttcacataca aaaattcatg gttagataaa ttatatatta 5700 
acatggcacc caggagtttt agaaagtagt ccaaagtact tgttactggg tacctagcag 57 60 
ccgcacatac gagcacacta actaaggtaa gagtttgaga attaaaaatt catcgttgga 5820 
acatgtactt tgaccaaaga gactcgccat ttcttttggt gttttgcaga aaggataaat 58S0 
cctgctttga agaagaaaat tgaatgaaat ttgcttaagc ttgtcatgta ttcttagcat 5940 
tataagatag caaactatat ccaagttgtg gatgaagtat ttagcaagtg atttataaag 6000 
taccttcaac tacagcatat tattctaggt actgaccatg gaacaataat cagtgtgaca 6060 
gtgaaccctg cttccattga cctaggccag caaatatata aaatcaagac atttataagc 6120 
cttacagata gctatatgaa ctgttgaaaa agccaaaatg aaagtgaaca tgtggcacgt 6180 
gacaaggaga ctacttgtag cctgggagga gagcattccc agttgccatc acatcagatg 624 0 
tttaaccacc atggtgcatg ttgtctccac aggttgtaga tggcactccc tgtagtccag 6300 
actctacctc tgtctgtgtg caagggcagt gtgtgaaagc tggctgtgat cgcatcatag 6360 
actccaaaaa gaagtttgat aagtgtggcg tttgtggagg aaacggttcc acatgcaaga 6420 
agatgtcagg aatagtcact agtacaaggt gagtttcaga acgctcactt ctgcagtaga 6480 
cacgctgtgt tgctcagttg gtccctagca tctacaagac cttgggttca atccgcatgc 654 0 
atgtacctgt agtcccagtg tatgggagac agagacaagt gtgacaagac ggtcagatgt 6600 
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tcaggtcatc tttgctacat agtgactttc agttcacctt ggggaacatg aaaaacctga 6660 
ctggaaacac aaacacacac aaaacaatta acccaggtac ttcatgtaat cccagtgttc 6720 
agtaggctga cttgggagga tggttgctat aaggcctagg ttagcttggt ctacataatg 6780 
agttccagta taacctggcc cacaagtgaa ccctaaagtt aattaatcga cacatgaaac 6840 
aaaacacatg ctttggagac cctgtaattt tgatatacga ttttgtagga ctaaggaaaa 6900 
gtcacattta aaagaattgc ctatttttaa agcaatgtga ttgattaact cattgaaaga 6960 
catatacctg ttttctttgt ccacagacct gggtatcatg acattgtcac aattcctgct 7020 
ggagccacca acattgaagt gaaacatcgg aatcaaaggg ggtccagaaa caatggcagc 7080 
tttctggcta ttagagccgc tgatggtacc tatattctga atggaaactt cactctgtcc 7140 
acactagagc aagacctcac ctacaaaggt actgtcttaa ggtacagtgg ttcctcggct 7200 
gcgctggaga gaatccgcag ctttagtcca ctcaaagaac ccttaaccat ccaggttctt 72 60 
atggtaggcc atgctctccg acccaaaatt aaattcacct actttatgaa gaagaagaca 7 320 
gagtcattca acgccattcc cacattttct gagtgggtga ttgaagagtg gggggagtgc 7380 
tccaagacat gcggctcagg ttggcagaga agagtagtgc agtgcagaga cattaatgga 7 440 
caccctgctt ccgaatgtgc aaaggaagtg aagccagcca gtaccagacc ttgrgcagac 7500 
cttccttgcG cacactggca ggtgggggat tggtcaccat gttccaaaac ttgcgggaag 7560 
ggttacaaga agagaacctt gaaatgtgtg tcccacgatg ggggcgtgtt atcaaatgag 7620 
agctgtgatc ctttgaagaa gccaaagcat tacattgact tttgcacact gacacagtgc 7 680 
agttaagagg cgttagagga caaggtagcg tggggagggg ctgatacact gagtgctgga 7740 
gggatccagt gagtcaaacc agtaagcagt gaggtgtggc aaggaggtgt gtgtagggga 7800 
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tacatagcaa aggaggtaga tcaggacact accctgccag ttacattctg ataaggtagt 78 60 
taatgaggca cagtagcatc tgaaagacca tacagagcac taaggagccc caaagcacta 7 920 
ttagtatctc ttttcttata tctatcgccc aaataatttt cagagtctgg cagaagccct 7980 
gttgcactgt actgactaga tacttcttat cacaaagatt gggaaaggca aagcagaaag 8040 
atggtaagac tgggtttcaa acaaggcttg gtttctatca ctggaggcaa ggaggagggg 8100 
acaaacaaga tcattattcg aagtcgctgg ttgctgtggt tttacggaag gttgatgcar 8160 
cattcctatc aacagtgaaa agttcagctt gttcaacgtg acagaaaggc tcatctccgt 8220 
gaaagagctc ctgatttctt cttacaccat ctcagttctt aactataatt catgttgagg 8280 
tagaaacaat tcatctattt ataaaatgta cattggaaaa aaaaaagtga agtttatgag 8340 
gtacacataa aaactgaagg aaacaatgag caacatgcct cctgctttgc ttcctcctga 8400 
ggtaaacctg cctggggatt gaggttgttt aagattatcc atggctcaca agaggcagta 8 4 60 
aaataataca tgttgtgcca gagttagaat ggggtataga gatcagggtc ccatgagatg 8520 
gggaacatgg tgatcactca tctcacatgg gaggctgctg cagggtagca ggtccactcc 8580 
tggcagctgg tccaacagtc gtatcctggt gaatgtctgt tcagctcttc tactgagaga 8 640 
gaatatgact gtttccatat gtatatgtat atagtaaaat atgttactat gaattgcatg 8700 
tactttataa gtattggtgt gtctgttcct tctaagaagg actatagttt ataataaatg 8760 
cctataataa catatttatt tttatacatt tatttctaat gataaaacct ttaagttata 8820 
tcgcttttgt aaaagtgcat ataaaaatag agtatttata caatatatgt taactagaaa 8880 
taataaaaga acacttttga atgtgtatgc ctattttctg gagtgggart aacttctggg 8940 
caagaaatct gatgagacac aaacattgga cttcaagaca gttttaaaat ttgggtaaat 9000 
gaactgtatt tcctgtttat agacgtacta ataaaaaaga agttgatgat gtctttagtg 9060 



-53- 

gtaagattgt tactaatgtg gttggcaaat tgctgtaaag agccagatag taagcattta 9120 

tggcattgta ggctatcttt cctgccacaa ccatgtgaca gtgagtgctt tgtaggactg 9180 

agagcagcca taaatgacat gtaaatgata aactgtggct gtgctttaat aaaactttat 924 0 

ttacaaaa 9248 

<210> 22 
<211> 5722 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 22 

ggacgcacag gcattccccg cgcccctcca gccctcgccg ccctcgccac cgctcccggc 60 
cgccgcgctc cggtacacac aggatccctg ctgggcacca acagctccac catggggctg 120 
gcctggggac taggcgtcct gttcctgatg catgtgtgtg gcaccaaccg cattccagag 180 
tctggcggag acaacagcgt gtttgacatc tttgaactca ccggggccgc ccgcaagggg 240 
tctgggcgcc gactggtgaa gggccccgac ccttccagcc cagctttccg catcgaggat 300 
gccaacctga tcccccctgt gcctgatgac aagttccaag acctggtgga tgctgtgcgg 360 
gcagaaaagg gtttcctcct tctggcatcc ctgaggcaga tgaagaagac ccggggcacg 420 
ctgctggccc tggagcggaa agaccactct ggccaggtct tcagcgtggt gtccaatggc 480 
aaggcgggca ccctggacct cagcctgacc gtccaaggaa agcagcacgt ggtgtctgtg 54 0 
gaagaagctc tcctggcaac cggccagtgg aagagcatca ccctgtttgt gcaggaagac 600 
agggcccagc tgtacatcga ctgtgaaaag atggagaatg ctgagttgga cgtccccatc 660 
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caaagcgtct tcaccagaga cctggccagc atcgccagac tccgcatcgc aaaggggggc 720 
gtcaatgaca atttccaggg ggtgctgcag aatgtgaggt ttgtctttgg aaccacacca 780 
gaagacatcc tcaggaacaa aggctgctcc agctctacca gtgtcctcct cacccttgac 840 
aacaacgtgg tgaatggttc cagccctgcc atccgcacta actacattgg ccacaagaca 900 
aaggacttgc aagccatctg cggcatctcc tgtgatgagc tgtccagcat ggtcctggaa 960 
ctcaggggcc tgcgcaccat tgtgaccacg ctgcaggaca gcatccgcaa agtgactgaa 1020 
gagaacaaag agttggccaa tgagctgagg cggcctcccc tatgctatca caacggagtt 1080 
cagtacagaa ataacgagga atggactgtt gatagctgca ctgagtgtca ctgtcagaac 114 0 
tcagttacca tctgcaaaaa ggtgtcctgc cccatcatgc cctgctccaa tgccacagtt 1200 
cctgatggag aatgctgtcc tcgctgttgg cccagcgact ctgcggacga tggctggtct 1260 
ccatggtccg agtggacctc ctgttctacg agctgtggca atggaattca gcagcgcggc 1320 
cgctcctgcg atagcctcaa caaccgatgt gagggctcct cggtccagac acggacctgc 1380 
cacattcagg agtgtgacaa aagatttaaa caggatggtg gctggagcca ctggtccccg 144 0 
tggtcatctt gttctgtgac atgtggtgat ggtgtgatca caaggatccg gctctgcaac 1500 
tctcccagcc cccagatgaa tgggaaaccc tgtgaaggcg aagcgcggga gaccaaagcc 1560 
tgcaagaaag acgcctgccc catcaatgga ggctggggtc cttggtcacc atggg-acatc 1620 
tgttctgtca cctgtggagg aggggtacag aaacgtagtc gtctctgcaa caaccccgca 1680 
ccccagtttg gaggcaagga ctgcgttggt gatgtaacag aaaaccagat ctgcaacaag 17 4 0 
caggactgtc caattgatgg atgcctgtcc aatccctgct ttgccggcgt gaagtgtact 1800 
agctaccctg atggcagctg gaaatgtggt gcttgtcccc ctggttacag tggaaatggc 18 60 
atccagtgca cagatgttga tgagtgcaaa gaagtgcctg atgcctgctt caaccacaat 1920 
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ggagagcacc ggtgtgagaa cacggacccc ggctacaact gcctgccctg ccccccacgc 1980 
ttcaccggct cacagccctt cggccagggt gtcgaacatg ccacggccaa caaacaggtg 2040 
tgcaagcccc gtaacccctg cacggatggg acccacgact gcaacaagaa cgccaagtgc 2100 
aactacctgg gccactatag cgaccccatg taccgctgcg agtgcaagcc tggctacgct 2160 
ggcaatggca tcatctgcgg ggaggacaca gacctggatg gctggcccaa tgagaacctg 2220 
gtgtgcgtgg ccaatgcgac ttaccactgc aaaaaggata attgccccaa ccttcccaac 2280 
tcagggcagg aagactatga caaggatgga attggtgatg cctgtgatga tgacgatgac 2340 
aatgataaaa ttccagatga cagggacaac tgtccattcc attacaaccc agctcagtat 2400 
gactatgaca gagatgatgt gggagaccgc tgtgacaact gtccctacaa ccacaaccca 2460 
gatcaggcag acacagacaa caatggggaa ggagacgcct gtgctgcaga cattgatgga 2520 
gacggtatcc tcaatgaacg ggacaactgc cagtacgtct acaatgtgga ccagagagac 2580 
actgatatgg atggggttgg agatcagtgt gacaattgcc ccttggaaca caatccggat 2 640 
cagctggact ctgactcaga ccgcattgga gatacctgtg acaacaatca ggatattgat 2700 
gaagatggcc accagaacaa tctggacaac tgtccctatg tgcccaatgc caaccaggct 2760 
gaccatgaca aagatggcaa gggagatgcc tgtgaccacg atgatgacaa cgatggcatt 2820 
Gctgatgaca aggacaactg cagactcgtg cccaatcccg accagaagga ctctgacggc 2880 
gatggtcgag gtgatgcctg caaagatgat tttgaccatg acagtgtgcc agacatcgat 2940 
gacatctgtc ctgagaatgt tgacatcagt gagaccgatt tccgccgatt ccagatgatt 3000 
cctctggacc ccaaagggac atcccaaaat gaccctaact gggttgtacg ccatcagggt 3060 
aaagaactcg tccagactgt caactgtgat cctggactcg ctgtaggtta tgatgagttt 3120 
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aatgctgtgg acttcagtgg caccttcttc atcaacaccg aaagggacga tgactatgct 3180 
ggatttgtct ttggctacca gtccagcagc cgcttttatg ttgtgatgtg gaagcaagtc 3240 
acccagtcct actgggacac caaccccacg agggctcagg gatactcggg cctttctgtg 3300 
aaagttgtaa actccaccac agggcctggc gagcacctgc ggaacgccct gtggcacaca 3360 
ggaaacaccc ctggccaggt gcgcaccctg tggcatgacc ctcgtcacat aggctggaaa 3420 
gatttcaccg cctacagatg gcgtctcagc cacaggccaa agacgggttt cattagagtg 34 80 
gtgatgtatg aagggaagaa aatcatggct gactcaggac ccatctatga taaaacctat 3540 
gctggtggta gactagggtt gtttgtcttc tctcaagaaa tggtgttctt ctctgacctg 3600 
aaatacgaat gtagagatcc ctaatcatca aattgttgat tgaaagactg atcataaacc 3660 
aatgctggta ttgcaccttc tggaactatg ggcttgagaa aacccccagg atcacttctc 3720 
cttggcttcc ttcttttctg tgcttgcatc agtgtggact cctagaacgt gcgacctgcc 3780 
tcaagaaaat gcagttttca aaaacagact catcagcatt cagcctccaa tgaataagac 3840 
atcttccaag catataaaca attgctttgg tttccttttg aaaaagcatc tacttgcttc 3900 
agttgggaag gtgcccattc cactctgcct ttgtcacaga gcagggtgct attgtgaggc 3960 
catctctgag cagtggactc aaaagcattt tcaggcatgt cagagaaggg aggactcact 4 020 
agaattagca aacaaaacca ccctgacatc ctccttcagg aacacgggga gcagaggcca 4080 
aagcactaag gggagggcgc atacccgaga cgattgtatg aagaaaatat ggaggaactg 4140 
ttacatgttc ggtactaagt cattttcagg ggattgaaag actattgctg gatttcatga 4200 
tgctgactgg cgttagctga ttaacccatg taaataggca cttaaataga agcaggaaag 42 60 
ggagacaaag actggcttct ggacttcctc cctgatcccc acccttactc atcaccttgc 4320 
agtggccaga attagggaat cagaatcaaa ccagtgtaag gcagtgctgg ctgccattgc 4380 
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ctggtcacat tgaaattggt ggcttcattc tagatgtagc ttgtgcagat gtagcaggaa 4440 
aataggaaaa cctaccatct cagtgagcac cagctgcctc ccaaaggagg ggcagccgtg 4500 
cttatatttt tatggttaca atggcacaaa attattatca acctaactaa aacattcctt 456C 
ttctcttttt tccgtaatta ctaggtagtt ttctaattct ctcttttgga agtatgattt 4620 
ttttaaagtc tttacgatgt aaaatattta ttttttactt attctggaag atctggctga 4680 
aggattattc atggaacagg aagaagcgta aagactatcc atgtcatctt tgttgagagt 4740 
cttcgtgact gtaagattgt aaatacagat tatttattaa ctctgttctg cctggaaatt 4800 
taggcttcat acggaaagtg tttgagagca agtagttgac atttatcagc aaatctcttg 4860 
caagaacagc acaaggaaaa tcagtctaat aagctgctct gccccttgtg ctcagagtgg 4 920 
atgttatggg attccttttt tctctgtttt atcttttcaa gtggaattag ttggttatcc 4 980 
atttgcaaat gttttaaatt gcaaagaaag ccatgaggtc ttcaatactg ttttacccca 5040 
tcccttgtgc atatttccag ggagaaggaa agcatataca cttttttctt tcatttttcc 5100 
aaaagagaaa aaaatgacaa aaggtgaaac ttacatacaa atattacctc atttgttgtg 5160 
tgactgagta aagaattrtt ggatcaagcg gaaagagttt aagtgtctaa caaacttaaa 5220 
gctactgtag tacctaaaaa gtcagtgttg tacatagcat aaaaactctg cagagaagta 5280 
ttcccaataa ggaaatagca ttgaaatgtt aaatacaatt tctgaaagtt atgrrttttt 534 0 
tctatcatct ggtataccat tgctttattt ttataaatta ttttctcatr gccattggaa 5400 
tagaatattc agattgtgta gatatgctat ttaaataatt tatcaggaaa tactgcctgt 54 60 
agagttagta tttctatttt tatataatgt ttgcacactg aattgaagaa ttgttggttt 5520 
tttctttttt ttgttttttt tttttttttt tttttttttg cttttgacct cccattttta 5580 
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ctatttgcca ataccttttt ctaggaatgt gctttttttt gtacacattt ttatccattt 5640 
tacattctaa agcagtgtaa gttgtatatt actgtttctt atgtacaagg aacaacaata 5700 
aatcatatgg aaatttatat tt 5722 

<210> 23 
<211> 42521 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 23 

gatcgttttc cagacatttt tgttctctgt tcatttcctt atcgtattca aaaagttatc 60 
acaaatgacc ttctctatct gtctgcgtct cttttaactc tcaccgtttg ggacctttca 120 
aatagttttt cgctatcaaa tctaaacatt agttgcgttg actcgacatt tgacccctca 180 
ctatcatctc cagttctctt ttttgttaca ctttagcagt ggcagcagag agcaagtagg 240 
tggagccaaa gtgtgcgcca ttcatcggga aaaattgtgt tcttcatcaa attttgggca 300 
atttactcgg gatttgcgct aatttggaaa caaaaattca aattcctgcc aattgttttg 360 
tttgcttttt ttcttttttt tttgctcctc ccatctctca tcaaattgct cttttttcga 420 
ttctaacata tcagccatct tcagagtgtg tcactaaccc ccatttttat tcaaggttag 480 
tgatatagta tcctaactac agacgtcaca ccatgaggtt gctgctcttc tcggcagccc 540 
ttcttctgtg ctccgtccca acgtgggcct tctctctgtc atcattcttc ggaagcgatg 600 
ttgcacaagt aagcaagctc tcctatacct agaatcttgt aaattgaaaa ctctaatttc 660 
cagaagccat accttcatcc aaactcccca ccggagcgtg acccggcgag ttccagaatg 720 
aagagacagg catatcaagt gtacgttgat ggagatgttt ccgttactgt tgacaagtct 780 
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ggacaaaagg aaaccggcaa ctggggacca tgggtgcccg agaacgagtg ctcacgttcg 840 
tgtggtggag gagttcaact cgagaagaga cagtgcaggt tcgtggactr ttcatttttt 900 
agggaatttc ctagacgttc taaaagctta ttttcaaaaa ttttggtttc ctgatcttca 960 
tgcctttatg aacgtggtga aagatcaacc taggctagcc tgtgacatac attttttgaa 1020 
gcagatccaa ctttatcaag agccatcgaa ttctcgtttt aaagtgtttt ttttttctga 1080 
taactttttt ctaatagctt tacccatttt tatgtcaaga ctgaaagcaa tgaatcacaa 1140 
gaggctatct acgtttgttt ttgaagctct gtaggaatca tcttaaaaaa ttaagtaaag 1200 
taatggagat gaaattctaa ttttttaaaa tcataatcat tactttctgt attatcttca 1260 
agttcaaact tttcaaacgg ttattctcaa gaaactcaca tagaatttta acaatttcct 1320 
ctatctattt cttgcaagca acccaccgaa ctcaaatctt atccaaacta aacttttagt 1380 
ggtgactgca ctggagcttc agtccgctac atctcgtgta acttgaacgc atgcgagtct 1440 
ggtactgatt tccgtgctga gcaatgctcc aaattcaacg atgaggctct tgatggaaac 1500 
taccacaagt ggactccata caagggaaag aacaagtaag ttaactttct tcaagatgtt 1560 
tttctaattt tcgagttttc aggtgcgagc tcgtctgtaa gccagaatct ggaaacttct 1620 
actacaagtg ggctgataag gttgttgatg gaaccaagtg cgactccaag agcaacgata 1680 
tctgtgttga tggggaatgt cttccagttg gatgtgacgg aaagcttgga tcttgtaagt 1740 
ttaaaattta attcaaaatc ttcatttcat gccgaatatt tcagcrctca aattcgacaa 1800 
gtgcggaaag tgcgatggag atggttctac ctgcaagact attgaaggac gtttcgatga 18 60 
gcgcaatctc tctccaggat accatgatat tatcaaactt ccagaaggag ccaccaacat 1920 
taagattcag gaagccagaa agagcaccaa caacttggct ctgaagaacg gttccgatca 1980 
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cttttatttg aatggaaatg gattgatcca agttgagaag gaggttgaag tcggaggaac 2040 
tatcttcgtt tacgatgacg ctgaaccaga aacrctcagt gctcaaggac cactciiccga 2100 
ggagctcacG gttgctcttc tcttcagaaa gggaagccgt gatactgcta tcaagtacga 2160 
gttctctatt ccacttgagg aggaagttga ctacatgtac aagtttgaca actggactcc 2220 
gtgctctgta tcatgcggaa agggtgttca aacccgtaat ctctactgta ttgatggaaa 2280 
gaacaaggga cgcgttgagg atgatctctg cgaggagaac aatgccacaa agccagagtt 2340 
cgaaaagagc tgtgaaactg ttgactgtga agccgaatgg ttcactggag actgggaatc 2400 
ttgctcatcc acctgcggag atcaaggaca gcaataccgt gtcgtcract gccatcaagt 24 60 
attcgctaac ggacgtcgtg ttaccgttga ggatggaaac tgcaccgttg agagaccacc 2520 
agtaaagcag acttgcaatc ggtaagttga ttttataaat gcataaacaa ctctgtgaat 2580 
ctatttgttt atgcgatgct atccatatat attaccagat ggtgttggtg cccaaaactt 2640 
ataaacaatt attttctctt tgcagttttg cctgcccaga gtggcaagct ggtccgtggt 2700 
cggcttgctc agagaagtgt ggagacgcct tccaatacag atcggtgacc tgccgcagtg 27 60 
agaaggaagg agaagaggga aaactcttgg ccgctgatgc ttgcccagct gatgagcaag 2820 
agaagttcga cacagagaga acttgcaatt tgggaccatg cgagggactt acatttgtca 2880 
ctggagaatg gaacttggtt agattttgca aaatatgggg acctggggaa aagcatacta 294 0 
aataagatca actttatgaa acaaataatt tttagtgcac ccgctgcaac gatactgagg 3000 
agactcgtga agtcacctgc aaggactccc aaggaagagc ctatccactc gagaagrgtt 3060 
tggttgataa ctccaccgag attccaactg atactaggtg agtcattcca gatatgacat 3120 
tgaacttgga ttaatttttt tcttccagat catgcgccac ccaaccacca tgtgagtacg 3180 
agtggaccgt cagtgagtgg agcaagtgta ccaccgaatg cggacacgga cacaagactc 3240 
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gtcgtgttat ctgtgccatc caccaaaacg gaggactcga ggttgttgat gaaggacact 3300 
gtcaagctga gaagccagaa ggaaagacta actgcaccaa tgaggagaag tgtactggaa 3360 
catggtacac atcttcatgg tccgagtgta ccgctgaatg tggtggtgga tcccaagatc 3420 
gtgtcgctgt ttgcttgaac tacgataaga agccagttcc agaatggtgc gacgaagccg 34 80 
tcaagccatc tgagaaacaa gattgtaacg ttgatgactg cccaacttgc gttgactctg 3540 
agttcggatg ctgcccagat aactctactt ttgctaccgg agaattcaac ttcggatgct 3600 
ctaactgctc ggaaacagaa ttcggatgct gtgctgacaa tgttaccgtt gccactggac 3660 
ctaactccaa gggatgcgaa gaattcgrtg agtctccact taaccttgaa gctgatgttg 3720 
ccaatgctga cgctgaagct tcaggagatg ctccagaact ctgcagcgtc acaaacgaga 3780 
acggagaagc tgttgatgtt gagtgtgcca ccattgctcc aatcactgct cttcttggag 3840 
atggggaact tatcggaaat gatactgatg cttccaatga gaccatacac tgctcgaaga 3900 
ccgaattcgg atgctgtcca gattggtaca ccgccgcctc tggaaagggt aacgaaggat 3960 
gcccatcgtt cactcttgga ggatgtaacg agactcaatt cggatgttgt cacgatgatg 4020 
tcactcttgc tcgtggagcc aaccttgaag gatgcggaga gccatcttgc gctgcttccc 4080 
tctatggatg ctgtaaagat cgtaagacaa ttgccttcgg accacactat tctggatgtg 4140 
agcgatcatc cttcccatgt gagcttagcg acttcggatg ctgcccagat ggtgagactg 4200 
ctgctcttgg aaagaatgga accggatgcg gagagaactg cttgaccacc aagttcggat 42 60 
gctgccctga tggaaagacc accgccaagg ggtcccacaa cgagggatgc ggatgcgagt 4 320 
tcgcccaata cggatgctgc ccagacggaa aatcagttgc caagggagcc ggattttacg 4380 
gatgcccaga aagctgcgcc cagagccagt tcggatgctg cccagacgga aagactcgtg 4 4 40 
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ctcgcggaga gaacaaggaa ggatgtccat gccagtacac ccgttacgga tgctgcccag 4500 
atggggagac tactgctctt ggaccacgca atgatggatg tgataactgc cgctacgcca 4560 
agcacggatg ttgcccagat ggagagacca aggctcttgg accagatgga gccggatgcc 4 620 
caccaactac cacgccacca ttcctcatgg gaggaactgt tgccccacat aaaatcgccg 4 680 
cctgtaatca gacacaagaa agtggaaccg tctgcggagc cggatacaag cttgtaagta 47 40 
attaacctca tgaaaaagaa ttggagcaac acatttcatg tataaatatt tcaatttcag 4800 
gcatggcatt atgataccac tgagggacgt tgcaaccagt tctggtacgg aggatgcggt 48 60 
ggaaatgaca acaactttgc tagccaggat atgtgcgaga ctatctgcgt cgaaccacca 4 920 
ggcaagggaa gatgttacct gccacgtgtt gatggaccac tccggtgtga ccaacttcag 4 980 
ccaagatact attatgatca ttccaagaag cactgtgtgg ccttctggtg gagaggatgt 5040 
ctcggaaatg ccaacaactt caactctttc gaagaatgct ccatgttctg taaggacgtt 5100 
ggaccgtacg atgctccaac caccgctgct ccaccaccac caccacagca aaatgctcag 5160 
caataccttc caactccaga agttcaacag attgagattc aatctgctga gcaacctcaa 5220 
ccacaacagc cacaacaaca gcaacagcaa caacagcaac aaccacagca accacgtcaa 5280 
tcaatggaag acatctgcag atcccgccaa gacgccggac catgcgagac ttactccgat 5340 
caatggttct acaacgcttt cagccaagaa tgcgaaacct tcacttatgg aggatgtgga 5400 
ggaaatctca atcgtttccg cagcaaggat gaatgcgagc agcgttgttt cttcgttcac 5460 
ggagctcagc catccgctgc ccggcaggaa caagctcagc cagcagctca accagctcaa 5520 
ccagctcagc caagtaacat cgtctctcca ccacaacagt cagctagtcc agttgtggtt 5580 
ccatgtaagt tctttagaat gcatttattt cttactataa gtttctataa gttcgcatgt 564 0 
gaagcatccc catttcagcg aacagcaaac aacgcgatgc ttgccacctc aacgttgacc 57 00 
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aaggacgttg taagggggct tttgactcct ggtactacga agttgccacc ggatcctgcg 57 60 
tcacattcaa gtacaccgga tgcggaggaa acgccaacag atttgctagc aaggatcagt 5820 
gcgagtcact ctgtgtgaag ccagcttctg aagctgcttc agccggaatt ggtatgcttt 5880 
gagttataga gaatgttcac tatttttgtt aaatgtttga gtaaatgaga aactggctca 5940 
gtttgaaaat gtttgcacca tgtttcaaaa tagtttttga gttgaatagt tgaggccatg 6000 
aaaatcttaa ttacactcca gaagtacatt ttaaaacatt tttgagaatt aggtcttcaa 6060 
aaaaaggttt aatattgagg tttcaaatta gaaatattaa tatacgggga tttgggttta 6120 
aaactgattt ttaaaatctt atttttgaag tttcgctttg atattcgtgc aaaaaaaaaa 6180 
ccaacttttt cagacggtgc agctggaatc aactcagttt gtgacgaagc caaggacacc 6240 
ggaccgtgca ccaactttgt cacgaaatgg tactacaaca aagccgacgg aacctgcaac 6300 
cgattccatt acggtggatg ccaaggaacc aacaatcgat tcgacaacga gcaacagtgc 6360 
aaggctgctt gtcaaaatca taaggatgct tgtcaacttc caaaggbtca aggaccatgc 6420 
tctggaaagc attcctatta ttactacaac actgccagtc atcaatgcga gacgttcact 6480 
tatggtggct gcctcggaaa tactaacaga ttcgctacca ttgaggagtg tcaagcgaga 6540 
tgcccgagta agttctaagt taatagtgat atatgctttg tttccccttt attctttgac 6600 
aattttcaaa tactttttgc ataattacct tatttctatt cccttctgtt tcccattttc 6660 
ctccacccgc tacaaattgt ttcccgtact ctctcctttc tcactttccc gtccgaaggg 6720 
acacggcaat gctgcctaaa tgaactgcct aataatattt atgaattttc caattttcta 6780 
aaaaaaaaca attctctcaa aaaattccct gccgttccgc cactgctttc ttcacccatt 68 40 
gttgcgctat tttttttaaa taaatgaata aagctgaaat agttaacagt ttctgaaatt 6900 
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gcatgtaagt ttgtagtgta tcagtgtgtt tgtcgtgaaa gttttttttt acctgcatga 6960 
tttcctgaac tgcatgaaac tgttcttatt acgttttaga tttgctgaag tgtgctagaa 7020 
gtgtgatttt gtttcagaag acgaccagac tacaacaaca tcacaaccag aagagctccc 7080 
aagtttgcca cttgttcaag aagatcctca gccacgaccg gcattttcat tgaagtaagc 7140 
acgtgtagtc caagtgccta cttctcgtat gaccaaaaaa tttaatataa ggtttccaag 7200 
tattaaggaa tcagtagcat gtaaattgtg tggattgttc tcctgggttg atgggttttt 7260 
ttctcactca caatcagata tggagtagct tatatgggaa tttatttgag aaatagaata 7320 
tgtcataaca tccaaattta attattaaaa agttgtgaag tttctcatta tgtatataaa 7380 
attcgccttt caaataagaa caaaaattaa ctgtatgaaa gagctgaatt caatttgaaa 7440 
ttgagaaaat aactggttca aaaagaagaa aaacgttgga aaatctagac gtaaatctat 7500 
ggattttctt ttcaggtcgg ggaaatttcg acgattttta tattttcaaa aatcattcac 7560 
aaatatacac caaaaattat ttttaccata ataaaatacg gaatttcact ggattactgt 7620 
agtattcatg taaggttact gtattgttac tctagggata ctacaagaat atttttgcaa 7680 
agttgtaaga agtatagaga ttactgtaga ttgaaaatct agacaaaaat cattttccgt 7740 
aataatctgt ggggatagaa tgttgaaggc acaaggctta taaagcacca tgggaaaaaa 7800 
ttttaacagt gattttttta agcatatcct ctttcccagg aaatccactt ttcaaatata 7860 
ttcccactaa actctttaag acaatccttc gcccatagtc gtcgccgtga tgctccattt 7920 
gcacgttccg tatccgcccg tcaccatact cctgattccg aagaggaacg agttgactgt 7 980 
tatgctgttc cagatccagg atcttgcggg taataaatct cacctatcca ttacaaccat 8040 
taccgtctta atgattcaga gactaccgtc ttgtttggca ctactctgcc acgagtaact 8100 
catgccgtca attctactat ggtggatgtg ctgggaatac gaatcgcttc gagacccggg 8160 
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ataaatgtga aacatcgtgt gttgctaaga ttgaagaacg cgtggaaagt gtgtcagaag 8220 
cttcaaaatc tctggaagag gttagactaa cggatccaag gatggattct cactttggat 8280 
atcatgatcc agaagttgat caaatcgaag aagaagctga atatgtcatt gtrgataccg 8340 
gagctctacc tgaattatgc atgcttccag aacaaagagg gtcttgttat gataacattt 8400 
tgagatggag gtaagtcaaa tcaagaatag aaaattcgaa aatccgaaaa actttataat 84 60 
tatactaaaa gcaaaatctt aaaatctttc agattcgact ctgaaaagtc tcaatgtgta 8520 
accttcatgt attctggatg taatccaaat gcaaatcact tcactagtca ggttagtttc 8580 
attattttgt gtcctttcgt ggaactggcc ccttggtttc taacttgatc ttctccttcc 8640 
gaatacccaa tttgagcacc gctggctcac tttttcgacg gtgacgttcc tcaattctag 8700 
cggcctctgt attttctgag cactcttgag caacagtttc ctcactggaa atgtttgttt 8760 
ttcaagaggg agtgagagag agaaataaac gtacaatttt tgaagccgca catgatttgt 8820 
tagaagtcga tgccgttctg cagtatcctt catgtttcgt agttgtttct gtagtaattt 8880 
ttatggatta ggaactaaga aatcatcact cactgcggta gttgcatttt tgtgcatgca 8940 
tcttcccata aaagcaacaa atgcaacaac tgatagagcc gccacacaaa ttgcaataat 9000 
tcgaagtcga tttctaattc ctttctttac tttttgtcta tgctcagctg ctttttcgat 9060 
gtgcttcttc ttgctggggt cgagctcgca atgaggaaat ggttcgatga gtggaccgtg 9120 
ttttttgcat tgttcacaac ggcgtccagt gtatttgtct gggcagtcac acgaaagagt 9180 
gcggtttttg aaa1:ctgaaa attttaaatt taagaacagg atctatagca gttttgccca 9240 
tcacagtcct atgtctatat taaaaaaaat tatcggacat taaaaaaaat gttttctcat 9300 
tttttcagta tttctataaa aactgcattc gcatttaatc ataactttta atcgttaaaa 9360 
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acttagtctt taagtacctg gggatccgta aacacagaca atttcatcac aataatcgcc 9420 
ttcaaatccc acatcacaga tgcatctrcc atttctcaaa aacccctcga cacatttact 9480 
tgtatattgg cattcacttc caaaatatga gcccacacat tcacatcggt cccccttcca 9540 
ttctcctttg tttccgcact gaaataattc aatagatttt ggaagtttag ggcctcaaaa 9600 
atataccttt tccgctggcc gatagtcaca catttcacct ttccatccga cttcgcaaat 9660 
gcacggctca ctgaatagaa ggctttccgg gtcgaagcgg aatccgaccg agagtccgtg 9720 
aactgtaaat tgaaaatttg taattccaaa aaaaaaacag cttttgcaaa aatcgtccaa 9780 
aagaatttta gagttagaca ttatttttct caaaaagttc aaagttgtat cagttttaaa 9840 
ataaaatatt taataggatt gtagagcttg ttagaaaaaa taaaagctac ttgaaaaaag 9900 
aaaggtatcc aaaaaggtat tgagatagtt tcaagcaact ctatttgtaa actgtcgagt 9960 
ttttaagttc tacaaatctc ttataacatc gctacatcta ctatcaaact ttgaaaaaaa 10020 
accataccac attcaaaatg ttcacattta tctccagtct gtcccttgat acaatgacaa 10080 
atccctccag catagattcc tccattacga cattcggctc tcggatcatc cagagcaaca 10140 
ttgtctagaa tacttctctt ttgaagaata cgatgcacgt cgctcaatat attttcatct 10200 
agatctagtg agtcatctcg tgattgtgct tttgttgttg ataaaaatag gaagagtaaa 10260 
gtggaaaatt gtaaacagta catagcgtta gatactgaca agtctactat caattgattt 10320 
atttattgcg tcttgaaagg ggtatcaatg agagaaatag ggagatgggt aaaatgcatt 10380 
tataagagaa tacaaaagat gacgtaattg attaatcaga gatcagttga aaatactttt 10440 
aagtatcaat tattatctgt gaagacagtc acgtgactct gactcgaact caatttgcat 10500 
gttgatagtt ccaatgttaa agaaagtctt tgggttttct ccagatgaaa caaatgattt 10560 
tggaatatta aacgtgactc ttctctgaca aggtttgagt ccgtcatcac aatcgtgata 10620 
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gatattaagt tttggatcaa tagtcatcac ttcggaagtg tgtccggtaa gaaggaattg 10680 
accaagagag tctgtagttc cttcggcaag aagatcgtca agatccggtc ctgaaaaaaa 10740 
cttttatttt gaaaaatttc aatgagttgc ttcatgttag aatttggaat ttttaaagat 10800 
gttagcaatt ggtatttaaa tgttcaagct aacgtaatta gagttattca aacaagcttt 3 08 60 
atataaaaac tttgtgtaag attcggtcta attagaacat caatttttaa cgcagctgat 10920 
aaaaaacttt aatttcaagc ttcacataat tctacttacc ggtatcatca tcgtagagct 10980 
tcaccttcgt gttagccagt ggtttgtctc cacacatcag aacaccctta actccagctg 11040 
attgggtgaa tacagcttcg gagccaattg cacaaagtat gaaaagtgat gaaatgcacg 11100 
cgagtcgtga cattattttt gtctgaaaat acaaacactg actgatctga ccttcatcgg 11160 
agaaactctc ttatagcaca gttggttaga aaaagatacg gagaggagaa gtgggaaatc 11220 
gaattgacca aacaaaagaa ctggttttca cttgaaatag aagacgatga aagatataca 11280 
acagagaaga tcggaagtga ttcatctgga gaagaaaatt gagaggagca acttcttgta 1134 0 
ttttccactt atttatatac ccaatagaat tcacctgatt ctttccgatt tgtgtacatt 11400 
tcgctgacta acgtgtgctt cttcggtttt gtcatttctt attgttcatt gaaaataaac 114 60 
agaacaaagc aatcataagg tcgaaaatcc catttagaga tcaagaggtg tacctttaat 11520 
tgtgcggcat ggcatagttt tatcttgctg aactctcacc aattgatgag tatgtcagta 11580 
gaatggattc catccgatcg ttgctccacg gtgatctctt ccgccgcctt ttcatccacc 11640 
atacccgttg tgtatggctg gcaactgtga acagcgcctc agtggaatgt ttagtttgat 11700 
atacagttta aaataatttt ctaaactaaa gaaatcagtt tttgaaacca gtcttgtagg 11760 
catgtcgggc gcaggcacgc taacgtgaaa aatagaattt cgagtggtta actattttat 11820 
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tttcaattaa aatacaatca actacacaat gaatgacccg gataaatgaa atacaaatac 11880 
aagaatttaa aaaaaacatg gaaatttaaa cttttccatc atctcccttt gctggaatat 11940 
tatatttcat tcgataagct tccaattcgg ctttrctctg atcggatcgt acactgtgtc 12000 
tctcatccat ctcttgttga gctgtcattc tcttctcatt ccatttctga gcttttgctt 12060 
ttttgtggat tctgttgtat tctttgcact tgcagcagca ataacagaag caggcaatga 12120 
gaattacagc aatgattcca gcgcagatgg caataacgat tgcggcggct gatgtgctca 12180 
cccagcagac attgtatttg acatgcttga tattacagtc tggatagtac cagtcgaatg 12240 
gcatacatct tttcgttttt ccaccacacc agaagcaatt ctgaaaaaat gtgtttttga 12300 
aattttcaat atgtttgctt ataaaattga atttaatttt tcaaacagtg tttcagaaac 12360 
tcaacttctg aaattaggaa agtattctca attgagagct gtttttgtat taaaagtttc 12420 
agtttagaac tacaggtgtg aaaaaatctg agcaagtgaa caccaacgta ttgcatcaca 12480 
gtttacgcgt caatttattc gagtgttcat tgtagagaaa gttaggtcac cttccagaaa 12540 
attaagaaac ttgtttcaga catttttgct cttttagagg aatttttttt tagaggaaac 12600 
acgcaagttt ctttgaaaac aaaaacaaaa tatatttttt atccacttac cgagcccttg 12660 
ccaacacatg tttcacaagt gttcaaatcg ttcgatccaa ttctacagta ttcttgtttc 12720 
tctgaccatg tcatgttatc cgcacatact gatactagaa caattgagaa aaagagtagt 12780 
aatcggtgaa tcatcgttct gaaaaatcaa taaatagtaa caacttgagc aagtctcgta 12840 
actgagcgac aaaaccaaag tagtaatgaa atagaaagat agaaaggtaa actcaaaggg 12900 
ctcgcgtgtg tttgtctatc gagtgccaat gagttttagg agtagcgaca gaaataagtt 12960 
ggcagaagaa gaacatacga actatgtcgg gctacaagat tcttgtgttt actttttgaa 13020 
aaagaaaatg catttgagaa aatgcaaatg ttcggcagaa atcgaatgga gtttagagca 13080 
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gaatggtaaa 
tgaaagataa 
gtttttgtat 
tttagagtgg 
aaataaagaa 
tgaattacat 
ttcaaaaacg 
ttcagctttt 
caaaagtgga 
atgagaaacc 
gaaatgtggc 
ggtatgtact 
atgttgatgg 
acataaatta 
attccagaat 
aggctgaggt 
aaaatcactt 
tataaagaag 
tcccaatgtc 
aaagcagact 



aataaaggtg 
caaaaaaaaa 
agtgcaaatt 
gtcttgtcac 
actaggcgcc 
taaattccaa 
tttttaaaat 
ctctctctct 
agtgaatgag 
aacatttgaa 
agtgtgcgag 
gttgacacat 
caaggtttta 
ggattaaaac 
ttgaaaatgt 
aggtaggctg 
gtaagcacat 
atttacattt 
aaatgatgat 
gtgaatctct 



gatcagcaaa 
taaatacaga 
cagtagttgt 
gatccatatt 
aagaagctat 
caaaatagga 
tgttcataag 
ctattctcta 
gatggaaata 
aatttccagg 
ttgccggccg 
tttacaaatg 
aatagatgta 
aaaaatacta 
ttttctctgt 
tagaaagtgc 
gaaaatcaca 
taataataat 
gtggactgga 
ttgccgagtt 



aatagttgaa 
aaacaatata 
acctgttata 
ttttgaacgc 
agtagctata 
aaatcatata 
cgccgagctg 
tcgtcaccta 
tgataccgca 
aaacttgtga 
aacacggaga 
ggatgggaag 
gtaactgaaa 
tgcggagtca 
ttgaaagtag 
ctttggtgtc 
tgaataatga 
aattccagat 
tgcggaggaa 
gagacattat 



caaatatttt 
tgacgtattt 
agtacagcga 
aatatttgaa 
cgcataaat L 
aaaacgaagt 
tccccctcag 
tatttcatag 
tgcttcaaaa 
acgagcctgt 
ttgccaactt 
tggtcggtga 
acaaaatgac 
ggtgactaat 
aacgggacct 
tttgtaattt 
tgtaaaattt 
ggtaccatga 
atggaaacgc 
ggtccaacaa 



gtagatttca 
ttcaatcatt 
agttatacat 
atccaaaaaa 
gtgaatacct 
tagttgtcaa 
ttttcgttta 
tccccttatc 
aaatttgctt 
ggtaaatgga 
gcgattccca 
tcaggtggaa 
agatgtacat 
ttttctggaa 
tttacaaaat 
ttgttttcaa 
agaaaattag 
cccaaaaaca 
gttctcttca 
cactgacttc 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 



tgtacattgg aacgatcggc cggtccatgt 
tcaactcatc tcgattgtaa gccattcact 
ttcgttagca aagagcaatg tcagcagagc 
atctgcacac tccgcccaga gccgggaccg 
gacccggtga tccaatcctg tcatatgttc 
cggttcgatt cagagttgga ctgcttccga 
gaaagcgaga gagtgggaca gctgacgtct 
aaaacagcga tttttgttgg aaatactgta 
atatttaaac tggtcccatg agagtttggt 
taacttttca tgaacacttt agccgatbtt 
tcaaaataaa aaccttacgc cactccactt 
tccgaatccg atgcaacagt tacggagtgc 
tcctccagtt cggctcgcga atcactgaag 
ctttaactgc tgacgccggt gtctacactt 
agggagtcga ggttgtgatc aagagacttc 
tgacaccatc caagaacttc tccttgggaa 
ctacaacacc cttccgaatc tatacgcctg 
gcccgacaag caattcctgt atggatgtgg 
agaacggttt gtgcgggaag aagcgatatg 
ttcataattt taaattttaa gtttggattt 
tgttttctcc ttcataatat ctccattgcg 
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acagattcta tttcaatgtg gtatttcgat 14340 
tatggaggtt gccgtggaaa tcagaatcga 14400 
tgccgtcctg gagacacaaa atctgaggat 144 60 
tgtcggctgg gactcgagaa atacttttac 14520 
cattatggag gttgtgaggg aaatgcaaac 14580 
cgatgctcga gtgtcaaggt tgaagcaagt 14640 
gcatccacgc cagttattta tattgttaac 14700 
agttattaat tttaattcga agatttctta 14760 
tcattttccg acaatagact gcaaaattga 14820 
agctagtttt gtttattaaa atttggtaat 14880 
ttgaatactt gtcaaataca ttttttcagt 14940 
ttccaataac atggtacaag aacggaggtc 15000 
agaatgatga cactttggaa attgtggatg 15060 
gcattgccgg ccaggatagt acaatgagcg 15120 
ctggtcacag aactacatct cgtccaatgc 15180 
ccccaccgac accatctcca tctacagttt 15240 
gatctgctcc atctgatgct cgtgtaagcc 15300 
gtaacgcgag cacgtgcgat ttgarcgtga 15360 
gaacattctg ctgtcacact tgcacccggg 15420 
tttgatttca aattttcatt aatcttttaa 15480 
agatctcttt ttcccttctc ttcctatact 15540 
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ttcccctcag acaattggct aattactcgt tcgttccagt aaataaatat gaatttattt 15600 
cttcttccta tactttggta tacataatca tggcatgaaa tacaagacaa aaaaaacaag 15660 
aaaaaacaat ccacttgaaa tccattcagg tgtgaactaa catcttactc tattaacttc 15720 
gtgccattac ttccacttat tttgcctatt cactaatgaa gtctctgaga attattttct 15780 
gtctaactct gctgattgca agcttcccag ctcagcggag ccgccgaaaa cagaaatttg 15840 
tacgccttcc tagtgggttc acgtttcctg cggatgcggc gagtaatttt caaagagatg 15900 
cgtatattcc agcgacggta aattttcgct ttttgttaaa tgaatttcag gcttcaaatt 15960 
attttctagg acaaaaattt aaagtaggct tgcgcatact catttccctg ccttacctgc 16020 
caacaggcta gcttttggag agaaatcaaa agtttggtgt ctgraaatct aagctttccg 16080 
aagcgtccga aagtttttgg gaatccgcta tacactttaa gattgataaa tatttgaatc 16140 
aggtttattt tgcactatta aggcgtgtag gcactaggcc ggcaaagctc gcctacgggg 16200 
agccttacaa tcaagtatta ttcatgaagg tcttgatttg gttacagaat tccatctaaa 16260 
attacttata caaaaacatg aaaaatttca gtttgccccg ccatctgaga agattcttca 16320 
agctccacca cgctatttaa ctggagaaca caatccagct tatggtaggc ccaatttttt 16380 
atctgatttt ctaaatttaa cttcaagctc acaataccga tgtgcaagga atgaactacg 16440 
ctgagtacaa gcaagcgatg gccccacaac cacatccagt cgatgcttat tctccaccac 16500 
ctcctgcacc aatggtccca ccggttactg tagttgaacc acctgcaatg ccgtatgaaa 16560 
tgactacgat tgcatctgtt ggaccactta ctactcccgc atcagtcggc ttgaagaagg 16620 
gaaagtttgt gattttagtt aattgatctt tcaagtaatt ggatacaatt tccagcatcg 16680 
gaggaattgc tcaaaacttg aacgacaggt acaccagctt aacaccagaa gctcaacgtg 1674 0 
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ctcagaaagg tcatacctat acggctctgg gcggtggaca attctatcaa agtttacttg 16800 
gaggggtaag atgcaaggtt agaacttaca aactcaattc attttacaga aaggaggccc 16860 
cggaggattc tccccactct cgttctttct aaacggcggt ctaggaggta ctggtggtgg 16920 
tggtaacaat ggattcttcg ttccggtgcc tgtagtcatt ccgcctccac cgccaccgcc 16980 
accaggacca aactgtttca cgaacccgtc gggattcctt tgctgtaacg tgacacttga 17040 
gaaaactatg gaagacgcgt acctggccgc aaaagcagat ggtgcatcac tgtgcaatgt 17100 
acagaaaatg gcaactgcag tgcaagcggt ggggtttatg gatttcattt tataatgtaa 17160 
tgtgctcttc cctagaattg aataagctta caacttgaat tacgacttga attacaactt 17220 
gaataagctt aaaatatcca ccaaatttca gcaagccgaa aaaaaattcg gaacaacttt 17280 
cgaatcagtc gctgctcatt cggacttcgt cgcaaaaatt aattttgccg gtgacctgaa 17340 
ctgtaaaata gaaatcgatg ggaaattcat actagcgtac gcaactccaa tcgccgagca 17400 
agaggtgaac attgtcgatg ctagctcatt cttctcggga gctgctgata aggatttgga 174 60 
tggtgtcaat ggtaccaagc ccacctacat tgtctacggt cccattaaat aatggagggt 17520 
ctagctttaa agatttctgt atattaaagc tgaaatgtga attaattgtt tatttgccaa 17580 
tcacaataaa gttggaaata tcatttgaat agttcgaaag ttttcaatcg gaatgggaga 17640 
aaattcgaaa atttaggtgg aggtgaaaag ttgatgaagt aacacaatta actgtgctcg 17700 
aatcctgaat agaaggagaa aagagcctat aaacagattt tcaatttaca catattacac 17760 
aacaattcag gaagaagaca gtagttgcaa aagaaaatac gtagaaaaaa gagtgaagga 17820 
ctggcgggat gtcagtttgg atgtacaaat agaactcctg aagcataaga aacagaagaa 17 880 
tcgaccgatg atcgaacctg aaatggattt attgttgatt gaaaaatatt aagcaattcr 17940 
gaatctctac cttgtttgat tgtgtgtaat gcaagaatct aaactcgtga gtgtgattgt 18000 
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tactgatccg gaaatgttcg gctgcttgca gcattatcaa tatcggatta cgcccacaaa 18060 
tcgtgttctg ggtctttttg aggtagtcat taaaagctgc cggattaagc gtctcaattg 18120 
cgctcattcc ctgcttatcc atattggtta tctgctcata aatcggaata gaactatgac 18180 
gatcgtacgg agaaaagctg aagcgttctc cccaatggca aaagtccgaa gagatcacaa 18240 
acaagtttct tggatcctcc atgtaatgag caaaaatatt tccatacgtt tgctgcctag 18300 
atcctggtaa agatccaaca agtaccggaa caatggtgta acgttttgaa cccataacct 18360 
ttgcaataaa tgggagttgc atttcaatac tatgctctga ttcttcatct cggcgatcca 18420 
tcaaatcgaa atgacgagtg gcacgaagct cctcgttaac tgcaaagggc aatgttgtaa 18480 
aagatgtact aagagtgcaa tagattactt ttgtgatcaa cgatcaagtc gccgagtgga 18540 
gttctgtact tgctgcatgt ggttatagca catccattta gagcaacaac gtgagatggg 18600 
ccaagaatga agactctttc actgaaagtt attgagtaag ccctgttgcc aagtacaaat 18660 
ttcaacaact cacactgctg atgaaacaac ttgtttgaaa gcatatgcag ctgtttctcc 18720 
acaatacgaa tatcccgcat gtctgaaagt tatcagaaaa taaatattaa atgcatttag 18780 
agtattacgg tgaaatcaac gctcgagccg ttccaatccg tggaccggcg ttgtcaagcc 18840 
attttgtgag ttgccgatca agatctcgct ggttggcgtt gtaccatgat ccggcatgtg 18900 
aggcagatct cgtgtgctcg ccgaatccgt ttagtgacat tttaaattca gatggtctga 18 960 
atattaaagt tttgataaat tgttgtatac gacttgatta atatgtttag tagggttttc 19020 
aactactgtg tgtttcccaa atagtcaaca ttgaaaaatg gaaaagtttg aatttaaata 19080 
ttcaaataat tttaattaat taatattaaa attcacaata cagtgtaaca tcacacttaa 19140 
ttcaagatgt tctaaaaata tgagccatcg ggctagctct acttcacgaa ttcgaatcaa 19200 



gtccggggaa 
atgggaatct 
gatatcttgc 
cctgcaacaa 
attctgaaaa 
tccgaaaatg 
actgatttct 
aatgctgatt 
cgaaagtttc 
cgtggacatg 
gcataattgt 
ctttgaaatt 
ctaaaattaa 
aaacgaaatg 
actcgggaga 
gcgcctgaaa 
aaacgtttta 
tgtgaaggga 
ctcgacgtgg 
tcggaccgcc 
atgtgaaact 



ctggctcgaa 
ggtttttcat 
aaaaggaagg 
caggatttgc 
cattttttgt 
ttgatgcaac 
ttccaaaaaa 
ttggtttttt 
agaaccttga 
gtcagacgaa 
tcaaagaaga 
ttcttttgac 
acaatcaaga 
taggctccaa 
cgtagagaaa 
ttttttagag 
aaactttgca 
gtgatttgag 
atggtattga 
gcgagtagtc 
ggctgcctga 



agaaaataaa 
tctgaataat 
ccaaatgtcc 
ttctattttg 
Gcaaaaaaaa 
aaaaaagtgt 
aaaacgaagg 
tttcaaaaaa 
gacaaaaaac 
tcttgttcgt 
ttcggtctaa 
aaaaggtcac 
taatcatgat 
tccaaactgc 
ctttgaatat 
gcttctttca 
ttccacccaa 
ggttctgtca 
agaggaccgc 
ttcagtctac 
agcaatgcca 
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tttttaattt 

ttccgagaaa 
tgagaaatag 
ttttttgaac 
tcaagaacaa 
tttttttttc 
aaaattttga 
aaagcatttt 
caaaattgtg 
gttcgcagcc 
aaagacgaaa 
cgttattcaa 
tgtgggcctg 
gcatcaagac 
taaacatcgc 
aaaagctacc 
aaatgtctga 
aacagtttga 
gctgatcttg 
caattacctg 
tataataatc 



ggtttatgtc 
cacttacaaa 
agcacgagag 
tgaattttaa 
tttagagcaa 
attgaatttc 
gaaaaaagtg 
gcaaagtgtg 
ttcccgagtg 
aattttcatt 
ttgaaataga 
aaattgagat 
ttttaaaata 
caagaatata 
cgtcaagttt 
catacaaata 
aattacccgt 
ctgtttcgcg 
tgctggtcgt 
aaaatttgac 
ataataataa 



cgaaatagaa 
ataaaattca 
ttttgaaata 
actattatct 
aatgtggcaa 
agttttgaaa 
aaaatccaaa 
tgcttttttt 
aagcccgcca 
tttgctgaac 
ttgtggaatc 
ggtctcgtga 
cacttttcaa 
aaatttttaa 
ccgtcagagc 
atcataagaa 
aaaaagaatg 
ttcgacgtgt 
cgtcgtcttg 
actttttgtg 
taatgaagag 



19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
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ggatgaggat gcatgccaaa agaatgaaag 
agtatttaga agaaaaagaa gactaaaaag 
ataggggagg cagtgaaata gaacgagaac 
gggccactgt ttcagtcttt ttcgcacttg 
ccaagaatgg aaaaagtggt aggtttgtag 
tgttcactac tttatttcgt gttcactcgt 
tcatgcctac atacctgcct catgcctatt 
attcttcatt ttataacaat tgtaactaat 
ttgtctaaat tttcctggta cactgaacat 
ctatttctaa gagccgtaag tattttcagc 
cttcgtatca tccatcttcc attccgtctt 
aatgattgtt tttctcgttg cctaatactt 
ctgaacaatg cacttttcag cggtcatcgg 
ttatcttcgt ttctactttt agacgaaaat 
tgacgctcac attcgctcac attgctgtgg 
aagggaaaac gagaaaatgt ttggtcacgc 
atctttatac atttttcacg gggttcaaaa 
tagaatgtag attctttata ttttcaatca 
aaaacgaaaa aagtaataaa tgagtacttg 
aaaagactaa aacgtgagac atctcacaac 



gaaagacgct cttctacaac accagccgat 20520 
agagtattgg gtgatgggag aaagaacaca 20580 
aatggaatcg gcagacattt gacactagag 20640 
aatattggaa gagggccaag aaggggagtt 20700 
aaaatctgcc tttttttttt taaaatttcg 207 60 
ttatgtcttc cattataggc aggcaaagtt 20820 
tgactttcaa tataaaactt gatttttggc 20880 
aataagcttt gcaaagtttt ctgaaagaaa 20940 
ttttcggtat aaaatctatg cgtatcaagc 21000 
tgaaaatgta aaccacggag tcaatattta 21060 
gtttacacct acggcaggta tttagacacg 21120 
tttcccccga aatattccca tattccagtt 21180 
gtccatccag ccctcattca gccctttcat 21240 
gcaaaaaaaa gagaaaaaga cactctcttt 21300 
tagaaaaaca ctcactcggt ggctgctggg 21360 
aatacgccta tatctttgat ttgactttga 21420 
acaattatga agaaaattgt ttgattaaat 21480 
aaaattaatt trggaaaaat aactatccaa 21540 
aaagtgaaat ggggcaatta aacaagataa 21600 
gggtcacggg caagaagtac acgagaaatc 21660 
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gaacgtgagt ggggaggcag agacactcag ctgactgcct ggcctgacgc tcgctcacaa 21720 
aacgctctca ctctcttcct cgctttgccc gctctccgcc ccgggtcgtc agttcggtcg 21780 
atccatgttt gttcattttt ataggtgaaa atttatgtaa gggaacggaa aatgtaaagt 21840 
gatcgtggga aaatagaaaa acaattacat tgtaactttt ctggaccaag ttgtacccag 21900 
atgcaatatg tatatttttc tcagaaaata ctgtgttggg tttcgacagg atcgatttat 21960 
caaaagcaaa cgagtgtgcg tctcaacgag cactaaagtt cccaactaga gcarccttgt 22020 
tgtggtagaa ctacatagaa atttttaatt ttgatttcaa tagcttttct cttgttttct 22080 
caaaatttat tgaaaaactt atttactata aaacgaccaa cgacggatct ggaaactaca 22140 
gtactcctta atgcaaaagg caacgaaaaa tcagccagtg acttattttt tgttctggat 22200 
aaaaatcggg aatatttgca ttttgaattc gcactgtatc gataaacaaa acaccgaaga 222 60 
tcacgccaaa atgactattg taactaacag gtacgagaaa gggacgcttg ttctacaaaa 22320 
ataattcaac aaattttccc caaaaaaatg tgaagtccgc aattctcgta gttttacgta 22380 
aatcaaaccg agcatgacac tctgacacca cgtgcgcctg aagatgtgcc tgcctaccat 22440 
ggatgcttta catttgctag ttccatgaca ccccatcctt tcagct'ccca agatgaagga 22500 
gttcggagaa aattcgaaaa aatattgaga aaaataaccc aaaacattct gaaacattgc 22560 
ggaaaaaagt tagaaattat gtcgaatata tctgaaccaa tcaacaattt caaataaaat 22620 
acaaaaaaaa attggaagac cttaaatagt ctccgcccat attttggctt caaatgaccg 22680 
tacttcggaa tatggccgat ggccgtggca agacctccaa tcgtagtttt gagcggtcag 22740 
taagtgaaga ttaaaatagg aacagtaccg taagatcagc ccaggtgcgg atgtgggata 22800 
gaggaactga aaataatcga agaagcatga taactaagcc acgtggccac gttcgttttt 22860 
gcgatgttaa tagatcgcca cttcgtccat tgtcgttttg tttgtactaa gtctccttag 22920 
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caattctctc 
atacattcat 
cactatctaa 
ccattgattg 
ttgcgtattc 
ggaggcatga 
acagaagata 
tggaagatca 
actctcgagt 
actgacgtct 
tacgatacgt 
ttattagaaa 
ccgtgaagta 
ctttcgagaa 
gtatataaaa 
ctcgaaaatg 
ttgaaagcgg 
gatgaacaac 
aacgcaattg 
taccaccaac 



gaaggcgggc 
tcactgatgg 
aaatgatgaa 
tcgtctccac 
actgtttcat 
tgcaagaagc 
ttcttgagaa 
gttttggtcg 
ctcaagtgac 
tttcgaagtt 
ttgggaaatt 
atcagtaagt 
tggaacaaaa 
gcagttgatc 
atattagaga 
cttcaacgaa 
ttcataataa 
gagaagtggc 
acgaggccga 
atcgtgtaat 



cattgctatt 
ttttcctatc 
gtttttgctt 
ttgtgctctt 
ttattttttg 
tccgattact 
gaaacaagaa 
aattaatcac 
tgtgacgtaa 
tagtcgagca 
ggaatcatag 
ttcggaaatt 
aacaagcgga 
catattgacg 
aataaattga 
tgctgaagtt 
catggatagc 
gaaggaaatc 
tttgctgcgc 
tattagaaaa 



agtaaaataa 
aggtgatcat 
cgcggctatt 
tttacgtctt 
cagaaaatga 
ccacaagaat 
ttcttggaga 
attaaaaagt 
ttaaaacatt 
aatccaaaaa 
ttttttaaac 
atttgagaaa 
tggctctcca 
gagttttggc 
ataacacggt 
ctcacggtta 
gaccaagttc 
gcggatgcta 
gagttggtgg 
tataccagga 



gctaccaatt 
ttttttgttc 
tggttgaagt 
acaacttctt 
gcctgttcag 
ctattcaaaa 
aaaaaattga 
gctgaaatcg 
gctcagcatt 
agagcaataa 
tccatttrtc 
cgtttcagga 
gtgtttgagt 
tactctcgaa 
ttttcttcca 
tgaaacttgc 
gtgatatgat 
tttcaaaccc 
atcttgaaca 
agcacttgac 



ttacctttca 
ttctcaatta 
gatgatatat 
tttaagtgtt 
caaatttttc 
acttcgggaa 
cgacgtaagt 
aaatttttaa 
tacattgttt 
aaatttctgc 
aaaaaataca 
aagcaaaatg 
aggaagaaag 
catcaggttg 
gagagaaacc 
tagcgatgcg 
ggataacata 
tggctttaac 
ggttcgtcta 
aaagatttgc 



22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
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ttgatgcgag agctccccca gtcacgcttc cggatactcc caatattgca cttccagcct 24180 
ccagaccgag agctaaagaa gctgacaagg atctagaaga cctcgaaagt tgggcaaact 24240 
aacttctcta agtcactttc atatttaatt ttcggctatt tttgtttcat ttgcatcccc 24300 
ttcatcaatc ctaccattct ccggagattc tcctaaatca actttctaat tacgacaaat 24360 
tcaaatagtt gaatgatttc tgtttagcca tttcantcga aacaaatttc cccaaggcta 24420 
cgatcaacac tcatcaaaat tgtaacatat tatcgagctt tttggaaatt tgtcatttta 24480 
tacatcttgg tccctttctc caaaatcttc caagcatgca ttaaagttcc aacttttatt 24540 
aaaaattcat tctggcaaac atgttatttg taccggttga aaacgaaaac caagcgagaa 24 600 
atagttacat ctcagatctc cctaacgatg gctcaacccc tttgacgctc atttactaat 24 660 
gtttatactt ttgctcattt actaatgaat ggctcattta ctaacttgct gagatttttt 24720 
aatttactac tgctaattgt aagatatata tcatttatca tttactatat araaagcgct 24780 
tattccgttt gtccatagtt tgtagtctat gtagtctttg tagtctgtga cgttttggct 24840 
tctggaagga tagtgagttg ggcttagtgt agggatatag ggggtactgr agtggtacaa 24 900 
tagtggtacg gtaggagtac tgtatgatta cggtagtttc agaaaaatta gttttcagct 24 960 
ccagaagtcg ggggccgcgc cggaggtgcg gtccacggct ggttttacat aaggtagttc 25020 
caaaaaatgt cctacttcca attactcata actcagttag cgcgctatag ctatagcgtt 25080 
tgagtttaaa aaaattgtgg ccaactgaaa tgctgtttgt cagagatgcg agctctaaaa 25140 
gatgatcgaa atattctatt tctgcggatc tagaatattt cgatcatctt ttggagctga 25200 
catctccgca atcgctaaag ataactaaaa ggtaccaatt aacaaaatgt gttttacaat 25260 
attgccaaca acattttagg tttctttcgc tgattgtttc cttttggttt tggtgatggt 25320 
cccggagtgg tttttttcgc tggttctact attttttgga tcggcaggct ctgaacaatt 25380 
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ggttgtacaa 
tccaacatat 
ctttttcgat 
caaatcagtt 
tagagccacg 
aattagttgt 
gtattattgc 
aatcttcgaa 
acgatttgca 
ataaagttag 
atcgagcatt 
aatatggatc 
ccatgaagaa 
cgttgtaatt 
acgaggcaga 
aacaaaccgc 
gcgagaagca 
tgtagtcttt 
atagcctgag 
acatcgatca 



tcttcttcaa 
tcatgcattt 
tttattttat 
tggaatgatc 
aaaaatgtgg 
acatataagt 
ttaaaattgt 
tagccgtgag 
gaggtagagg 
taggaacgat 
gatggcatcc 
cactgaaaat 
ctgcgtcagt 
ctatgagaaa 
caaacatctg 
ttctagggca 
taaatggtag 
gagataatca 
aactgtgcac 
tgaactacca 



cttcatcaaa 
gacggaactc 
gaaaacttta 
tccttcattc 
gacaattgtt 
aatacatgaa 
gttccaatct 
cacaggcttc 
acgaacgact 
cgtactgttt 
aagaccttga 
tgaggttata 
aactcgtttc 
acggtagaat 
aaatttgcgt 
acatcagcta 
aataaatgaa 
acgcaatttt 
acgttttgtt 
tcaataacat 



ttatccagag 
ttcaactttc 
ctatcataaa 
aaaattctta 
tgagaagtga 
aatacatctt 
gccagtacta 
gccggcactg 
ttcctgtaat 
ttagaacgag 
acttcttcgc 
gtagattatt 
agattctcgc 
ttggtgacat 
tttttccaca 
aactgtgatc 
agatatcggt 
gtttcagatt 
ctgaaaataa 
ccggatataa 



ttatgttgtc 
tgcattgtca 
caatagtatt 
atgatcagtc 
aaaatagtta 
aaaaatacag 
tgagcgtaat 
cacacaaact 
tggcgaaata 
actgtctagc 
tgaatgatat 
gggagctatt 
tatccttttc 
ttgtcgagtt 
aatgcataaa 
atgctcgtat 
aggttcgcgg 
cgtcatcagg 
atttggcatt 
accaagagaa 



gcttctgctg 
ctggattctt 
tatcatgtta 
gattcactct 
ttaatgttgc 
ttactactag 
tcgttgatcc 
tcacgattgc 
ttgttttaag 
tggtggccgc 
acgatgcttg 
atgatttcac 
accgcttttt 
aaacaattcc 
ctttcaataa 
tcggcgttta 
gaatccggat 
tatttgtcgc 
cattgaaact 
ttgggagaaa 



25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
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tgacagtgat caacttgaga atatcttccg gtgactcatc aaggatattc agctgcttta 26640 
tggcgccttc aaggaaaaac ttgttctcca tcaagatgcg gaaataatcc gaatttcttg 26700 
caaagattgc tggatccaca aagtactttt gattaccaac aataataggc cagtttcgaa 267 60 
gcttgcttgt cgactcaaaa tcgacctgaa agaaaaatcg aaaaattcca atttaaaaaa 26820 
cgtttgttta cgtaatcgga tccttctagg aaggtttcat gacttgttgt cggctgcatt 26S80 
agaatgacgt ttacggggaa atcattattt attccgaaac gtgcttgggc ttgttctgtt 26940 
tgctgaaatt ttgaaaggtt ctccgaatat taagcgaaaa aaacttacat taataatata 27000 
aggtctcata gcgccgagta gctaaacaat taatatttga ttacaagttt ggaaagatct 27060 
ttctgagctc gatcaggaag aaaaacttct tgaaacttta gaagatgaaa tgtgtgctac 27120 
cgtataaact ttaaaggtgc atgaataaat ttctcctttt ggtcctgcga cgattaaact 27180 
ttttaatcaa ttctctgggc tagtttttat tcaataacta gaaatgttgt ttatttttgt 2724 0 
tccctactta aatcatatgt tattttcttt ttcctttgtg tcttacaggc ttttttagct 27300 
gaagaaatag caattttccg ataaaatttg ttgctctatg ttaaaggcgc atgcatttat 27360 
ttgagagacg ggtctcgcaa cgtgctcact cctcggcccg atttgttctt cgtttgcgcg 27420 
gttttcaggc ctttaaaaga tagttccgtc gtttttttct caatttctgc tgaaataagg 27480 
tttaattaaa tttattttca aaatcttggt aaacarttaa actcatatat tcagaatttt 27540 
cattcctctt tcacccagaa aaccgaattt caatattaag attaagaaca catctagaac 27600 
atgcaaaaaa cacaattgct atctctctac tttcatttta aggctgattt tttgaagaaa 27 660 
aatcatgaaa tacgtccatt attgttgtat cccttgtttg catccaaagt tgactcgatt 27720 
gatctcttaa atgtggtatt ccgttcgaaa ttcgattgat ttttagaagt taacacattc 27780 
ggaatgatga taattcgtat caaaccaaaa ttgtcttctt ttcgcctttt ttgtgcagtg 27840 
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tcagcattaa 
tgacatcacg 
cttgcgaaga 
gtgagaggcg 
ccagccccat 
tcctttcatt 
tacagaagtt 
gtgcatgttc 
tcaagaaggt 
ggcgacacgc 
attacgcgta 
gactgcctct 
cgatctttta 
gatttagagt 
ccattttagt 
cattatcaaa 
tgaaatgagc 
tgtgtgttcc 
gggttgaaaa 
agaaagaaac 



acaaaacgag 
aaatgcagac 
taacgtgtta 
cagatggttc 
caagcctctt 
ccgtcaagta 
cctcttcttc 
ttgtgcccac 
aagcatttaa 
gtttttccac 
cgcaatcgga 
tatgctttct 
tctttggcaa 
taagcatgaa 
aacagtatac 
caaaagacat 
gtagttaaat 
cttttctccg 
aaaagtatca 
tgaagatttg 



aatattgaaa 
gacaaagacc 
cgggttgaga 
agagaacact 
cgggcatcga 
tttcgcaact 
tacttggtcc 
acaaactcgc 
acgtgttggc 
aatttcattc 
cgaatcctgc 
gaaaaaagag 
aaccttgata 
atgtgcaagg 
atcagctgat 
tacaaaaaca 
tcggaacttc 
cgtttgcccg 
aacactgatg 
cgaatcgata 



gttacgtggc 
ggtgaaaaat 
gagaaaacat 
agagaaaacc 
cgcatagaca 
aatcgttatt 
gaccgcatca 
gcccatttac 
cgtgcgtctc 
cctagggcat 
aggtttgttg 
aatgacagtt 
taagcattat 
ctaaaataaa 
agaatcacat 
agaaaaatac 
aatagattat 
ctacaaaccg 
gtgtcttttt 
gcgtcgtcat 



gtttgcatct 
agtgcgctga 
tccgcgagac 
gcgcctctgt 
cacatcattt 
gctcattaca 
gataactggg 
aattttatga 
aaaaaattgt 
tttgtatttg 
gtagtcaatt 
ttcgctaagt 
cacagcatat 
ttacaaaata 
gcgtaatgac 
aatataatag 
catacgcgct 
gtgagtcgga 
tagggaggtt 
ctctcgacgc 



ctcaccacga 
atggtgaaaa 
aatgcttttg 
ccgctcacag 
tgccccaatt 
atacacattt 
aga-cccagtt 
tcgacaaccc 
taaaaaacct 
aagtaattct 
ttatcaagtc 
agtactaaag 
catgcagatt 
agtccatagt 
aggtctaaaa 
aacgactatt 
tttaaaaaaa 
aggcataatc 
gtccagaaag 
cagtgaagtc 



27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
286S0 
28740 
28800 
28860 
28920 
28980 
29040 



aagatcggtt acaatagtgt atgcgat-ccc 
atttatcatt tgtaagtact aacaagagat 
ttccagaaga agatctaaac aaaaactatc 
gctgatccat tctaccgcta caagatgccc 
aacggaatca aaacggtcat ttccaacatg 
ccgatgtgta tgtttatcgc cagttggctc 
tcagacccca cgaagtactt tggctgtgag 
aacgagcgtt acattgtcaa cggcgagcat 
ggtttcatta aaaagtttgt gctttgcaaa 
cgagatcatt gaattaataa tctgtctaat 
catcaagagc aagtgcaagg catgtggatg 
gtctacattc atcatgaaga atcctccaaa 
actaacattt ttcgattgaa cttatgcaaa 
atcctttcaa ttcgattttc cgtgtgcttc 
tccagccaaa gccgaacaaa agaatggaaa 
cgtggctgcc gacataarcc acaacagcga 
cgattgggaa cctgaaccag tcgagccgaa 
cgtgctggac aaggatcttg agaagagcga 
cttcttgaaa gccaaggaag aaggtaagaa 
atttcagata gaatttctga tgccaaggga 
cttgagctga agcaaaaagc atctctcctt 
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caaaatccac atatcaaccg gactcgtgat 29100 
gtgaacgtat ttacactcaa cattagcaaa 29160 
gaaatggctc tcaacgtgaa ccgcgctgtc 29220 
aagctgtcag caaaagtcga aggcaaagga 29280 
tctgagatcg cgaaagctct cgagcgtccg 29340 
gccattggac acaaaaataa ccattgtttt 29400 
ctcggggctc aaacgaactt cgatgccaag 29460 
gatgccaaca agctccagga tattttagat 29520 
tcatgtgaaa acccggaaac tcagttggta 29580 
tttattattt cagtttgtcc gtaaaaataa 29640 
ttcgttcgac attgatctca aacataagct 29700 
gattgatgtc gatttttgta agtatcgttt 297 60 
attctgccaa aaattctatt tgcattttaa 29820 
cagtgcatac aaacatgcta atttttggtt 29880 
gaagacatcg ggtgctgacg ccgccgccgc 29940 
caaaggcagt tcgaatgatg acgacgacga 30000 
tggcatgctg tcggcgggaa tgggcaagct 30060 
agaacagcgt ctcgacatgc ttcacacatt 30120 
ttctgagcat tgataaaaag tattctcgtt 30180 
caaactgctc tacgtgacga agctgagaga 30240 
ctcgcgaacg tttttcttga tgagaaagta 30300 



atcactgaca aacaaatcag caaacaccgc 
aagaaagctc aaagatacct gttgggagga 
gaacttctgt ctaaatcagc tcacatcatt 
gaggattcgc ttatttcatg gggagagaag 
taaagtttat atttacagcc gtcgagtaag 
attgagaact ctcaaccagt gctcaactgg 
gagtccgacg atgagattgc ggtaagaaat 
ggttttcagt tcggaggaga cgtcaaggag 
gctgctagag aagctcagca aaaatcagcc 
tccggagcaa atgatgaaga ggacttggat 
tttaaaattt acctgggcta cttatgtttt 
tcaactaatt tcgaagaagc ctcagttttt 
atcatttcat ttctgtcttc tgtcttttct 
caattcattt attcgtaaag tcattactat 
tcttgttcaa ttcagtctta tattacaaaa 
agatgggaac ccgtcgaatt cggcggtccg 
ttagcccatt ttcaaatcat cacccagctg 
aagagccgaa aaccttttgc tgctcggaac 
tgatggattc aaaactgatg gcagcgattc 
caatgtgttc taaatggtcg ttgaaaggat 
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aatcttctgc ttcgcttcac gttgaatgac 30360 
gttgagcaag taattcacaa acatgaagcg 30420 
aagtcattgt atgatgaaga tgtctgcgaa 30480 
gttagtacca aatggagctt tgtttcgaat 30540 
tatgtctcca aatcttttgc caagaagatt 30600 
ctgaaagaag cggaagaaga aaccgaagaa 30660 
atcagatttg tttttttttt ttcaatggtt 30720 
agtgaattcc ttcgtcaaca gaaggagaag 30780 
aaggctacaa acggcaatgc tgctgctgca 30840 
attgatgaca tttaattgta cagatgcttt 30900 
ttgtgtattt cttcccatat tcgaaccaat 30960 
ttttgctttc tccccctttc aatagtaagc 31020 
gttcctacgc tgttttccct tcaccaaatc 31080 
ttgttgttaa tcgtaaacat ttgggaatat 31140 
acacaatgtt caaaaaaaaa gaatcacttc 31200 
atggagaata cacattgttt tttcggaaag 31260 
atttcatttg cgacgaagcg ataaattgta 31320 
agtactatat gtacaataag gcttcactat 31380 
tagaagcaac ttgtccgaaa acaatgaaga 31440 
ggaaggattc ggtgtaagtt ttaaatcagt 31500 



ttgataataa aatatgtttt tcttttacag 
aataggatgc ttggcaaaat tccgtgcttc 
aactggtagc aaactagtcg aatgtagtcc 
tccaacggat cgtcttattc ttccatcgcc 
gaaatgtttt gaaatcattg aaatcatctt 
gtctgaaaat ggctgaatta ttataggaaa 
gagctctggc aagaatctgc tgggaaagat 
tgattcaaat tacaagttag ctctggaatc 
tcagagatga acgagaagaa gtggagaaac 
ctgctataga gcacatggac ttgatgtaca 
aatcagaaaa tctgcgtaat tgtcgacaga 
attgttaact gatgatagat cctacatcac 
ggcttggccg ccgatcctcg tgaaaaacga 
cgatttccct aggtacttgc cttgatcttt 
atcatctcca cttcgagcag ctgaaatatc 
cttgagtaaa aggcggtacc tcaggagcag 
aagacgctcc agacatcttt cccgaagtgg 
aagatccgaa agtacatctc gaagacgttc 
cccaccacga aaacgtccgg tacgccgatc 
ccgaaattgg acaagagcac ggagcagaac 
aacctggcca ctgagcccat cgagaagcag 
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atgggatgag aacaaagaag aagrgatgag 315 60 
tcgccatctt cgttatgctc tttttctcac 31620 
gttcgataaa atatggggaa tcggttagtt 31680 
catcacaatg caatcagaat cttcaaactg 31740 
tgagctgata tggtgacgga agaaaaggac 31800 
agatatgcaa gccgcacaat gggctccatt 31860 
tttggatgga atccgagagg aattgtggga 31920 
agaaaattat tattatataa aattactatt 31980 
gaatggaaac tgaaagagat tatctattca 32040 
aagaaagagc aacaaaaaga gtattgtaag 32100 
aataacgtat tccagattgt tcgaggaaga 32160 
accagatatt cagaggctcc ttcccgactg 32220 
gcctattcaa ccatcgctgc ctgtaataat 32280 
aatttatcag aattaacttt caaattccag 32340 
acgtaggaag agcacatctc attcgacaag 32400 
atcgagaagt ctgtccaaaa gcccggctcg 324 60 
atcccgtaca ccagctcaac ggcattccag 32520 
cggacggcac tctagaagtc gatctagaag 32580 
aagaagcaga tccaggagca ggacaccaaa 32640 
aagaagtcag gctaaaagta gcagcacttt 32700 
aagtaacagt aatgaaagga atttgaaaga 32760 
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gaagaaagac 
aaaacaccgt 
gaaaagagag 
atttttgaaa 
tttaatatca 
gattaagtac 
aattacaatc 
ttcattctct 
aaaaaaaaaa 
attagttttc 
tgacaatttc 
gtattgagca 
aaaacactac 
gatagaatat 
ctacagtagc 
tatgttgcgc 
ctaaaaacta 
tcttgaagca 
atatgatttc 
actaaaatat 



cggaaaaaga 
tctaaaagat 
aaaaagaaga 
actagtcata 
aatgtctttt 
tgggtatatc 
tgtctgctgg 
gtaaggtttt 
actaaatacc 
cgcggcgaga 
ttgaaacaag 
gcatatttgc 
Tigtagcgctt 
ttaaaaaata 
catttaaaga 
aatacgcatt 
agaacaaagt 
aagttcaaaa 
ggggcagtgt 
aaacatatga 



aaaaatctga 
ccgaatccag 
aacgacgtcg 
actttaaaag 
caaatattct 
attaataaaa 
tgaaaattgt 
gttcgttttt 
tacagtaatg 
cccatcccca 
aacatttgta 
cgagccaaat 
gtgtcgattt 
aattcatttc 
attactgtag 
ctcagaattg 
aaattaaagt 
gagcatgaat 
aaactacagg 
attcaagggt 



gaagaaacgg 
agaagaacgt 
gagaagttcc 
tcataacttt 
ctatttattt 
ttacgatact 
acatgctatt 
tggaaaattc 
ccagaggcat 
caaaagtatg 
tattaacgaa 
atctcgtagc 
acgggctctt 
gaaattagag 
ttttcgctat 
tgtcttccgt 
ttttttctgt 
caattggaaa 
gcacagacat 
cataat aaat 



aagcatcatt 
cacagaagac 
actacttcag 
tttaaaagtc 
attcttcgta 
ttgccgaata 
ttcttgttcc 
tgagagtagc 
atgctcaata 
actcccttga 
acacaaaatt 
gaaaactaca 
tgaattatca 
Gccataaatc 
gagatatttt 
aataatagac 
tcacttcaaa 
gtgttcaatg 
aaaaatttaa 
gtattttttt 



ctaaatccag 
ggaaggagaa 
attaaacttt 
ataacactgg 
attaaactga 
aatcagttat 
tcattctttt 
cggaaaaaaa 
attatcaaaa 
aagtcgtaaa 
ccgagaatgc 
ttaattctta 
ttgatttatc 
gacacaaaca 
gcgcatcaaa 
agtggcttcg 
ttttacacga 
caccctacag 
attgttgaag 
aaataatatt 



32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
335 4 0 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
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tattaaatgt atgcatacaa ttaaatacaa cataattatc aaatacaaat attataattg 34020 
caacctgtcg gacaacaact ttgctgaggt gtcgtgtgac agtcagaatc cttgtcacac 34 080 
cagctgaccg gctcagagac gatacatcgg aagttgagat gagtgactgg tggacattgc 34140 
cgacgcgttg gagcacaaca ctcacgatat cgagtcatgt cgatgcagcg ctgaaactca 34200 
ggaaactatg tggaatttag gtggatcacc caaccagctg cccttcaccg cactgataat 342 60 
ttggagtgca gtacatgtaa tgggcagagc attgctgcat ttgcatcaca atcaatgaat 34320 
ttgcaaaggg cctggagatt ggcttggctg aaagagttga tattatttct attgatataa 34380 
taccctaaat ttacgaaaat tatgctaaat taggatttta gttataatcc tcgtcacatc 34440 
tgatctctga aaacttaaaa atatcctttt tggtagtgtg gcaccaaatt cgtgctgtaa 34500 
cagagaccaa aaacactact ttttcgacat ttcctctcct tgcagcgaaa aataaaattt 34560 
tttgaaaatc tgtgttttct catacccgga aaaaaccaac aaaaacggcc ttgttccaaa 34620 
ggcggtgagt atttctattt tatgaaagtg gccgagattt ctctttttct acgccaagta 34680 
gttaattctt cgcggcaaga cccatcaatt ttctaacctc taatctcttt ttcaacatga 34740 
atatccacgt catcatagaa tttgcactcg ggcttataga tttggagcct ttgaaagtat 34800 
atgcaccagt ctatatgggt gttgggaaac gaataggcag tagttttttg gaccaattgt 34 8 60 
agaatagaca gtagtaatag ggaagaatat aagaatttca taattcagat ttcaataaaa 34920 
aataaattta attgagaaaa aaaacggttg atattctttt gtttaagcag acaagtatgc 34980 
ggaagtgaat cttgagcacc tcgtaaatca cgggaggcgt acttgtacag aagagagata 35040 
agggattaag aggcgcaagc tttgccactt tgaagttaaa aaataaagaa agagacatgc 35100 
aaattggtgg acaaatagcg gaaggttagc gggaggtggg aggggggaca ggtgcatgta 35160 
acacaatgga ttttacaata ggaatattga aaatacgcat atgggaaatc ggaacagata 35220 
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tgaaggtgtc 
aaaaaagtgc 
ttatgaattg 
aat gggcata 
tttgaatatt 
ttctgtttat 
cttgatccct 
tgcaacattt 
gaatatattg 
cacattgata 
tccattcacc 
aacagtacga 
ttctgttact 
tcgattttgt 
tcaaatctga 
atattttgtt 
tttgggtatt 
ttaaattatg 
agacttccca 
actatatatc 



aatatttgag 
ataattcaca 
atggtggcaa 
ttcgcacaca 
tagattaata 
cattaattga 
gcattatcag 
ctcgctactg 
gtgcccggtt 
acatccaaaa 
aaaagctggt 
gttgtgagcg 
actttttggt 
acagaattgc 
caaaaaaaaa 
ggaaagtttc 
caaaatatta 
aacaattaac 
acgcttcttg 
ctagtgtgtt 



gtcaactgtc 
gattgaaatt 
caaaacacaa 
ttccacacaa 
aagatgaaat 
aaatgttgaa 
gcaatcctac 
gaacacgcat 
ggaagttaac 
gttcagttat 
cgccatcttg 
tagttccact 
tgcaacaact 
acaatatatt 
tttttttaaa 
aataatttca 
accacacgac 
tcactttaca 
agaactattc 
gctcgttgca 



tggtttttcc 
ggaaattggt 
attctacatt 
attgcataca 
aattgagttt 
ttacttttta 
ataacctttc 
actggaatac 
aattgaattg 
gtatatccat 
tataggaatg 
gaaaattact 
ctgagaaatt 
ttggaatagc 
aggtgcctat 
ctacatttac 
cttcaataca 
ttttgtcctc 
gaaataatat 
acgtctagag 



ccgatttttg 
cgagaaaaga 
tgtaccaaaa 
tattccacaa 
tatttgtaat 
atagacgaat 
aacgttgtcg 
gatgacgatt 
ttgttaagcg 
ccgtataaat 
aatggagtta 
aaatatttag 
ttagttttca 
aagaaattgt 
caatttttaa 
tatttctttt 
ggaaaactgt 
cattccttgt 
aaatcttcga 
tatccaatat 



aattttttga 
ataaggagtg 
tgcccactaa 
tggggaatat 
taaaatattt 
catcaaagaa 
ttttaccaat 
ccaattggaa 
ataaaggata 
cttgcgatct 
aggatcccgt 
ttcaaaggtt 
ccaaaatttt 
tcagtgaatg 
aaatgttcta 
ttaggcctat 
caaatttttt 
agttaatata 
atttcttcct 
aaacccacta 



35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 



gaagctgata 
gaccaacttc 
tttttgatag 
gaatgagtca 
tatgtattta 
gacgggagaa 
acgcaaactt 
gcatttgttc 
tgaaacatat 
atgggacgta 
aatcgaaaag 
ggcgttttat 
ggttttttcg 
cgaaggctac 
cggggcgcga 
cgccacgtcc 
tacttaaaaa 
gacacaatgt 
aaaaaatttg 
aataaccatg 
ctgaaaacaa 



taaagaaaaa 
aagacatttt 
aaaatgataa 
aattcacgta 
ccggactttt 
aggtaccaac 
caaactagga 
ggtgacttgt 
gaaatagtcc 
gagtactccg 
ttagtaaata 
agttttttct 
agttccagaa 
tccaaggtaa 
ctagatcacg 
gcgcttcagg 
ttaatcgatt 
tgaaccgtac 
aataattttt 
aagcacaact 
aagagctgat 



taatagaaat 
atatgcttaa 
gaaatgcgac 
aaacttggca 
caagacttta 
aaatatcaga 
gagctaattc 
ttgacagccc 
aggccttcca 
aatagagcag 
tgttcaagga 
ctttttttga 
aaaatattta 
gttaacccta 
ggttcgcgct 
aggagcgatt 
tttttaaaaa 
tagattgttg 
ttctaacttt 
tcataagtgg 
atggagcaag 
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atatttttca 
aatcacgtca 
caaaatgtgt 
tttgtcacag 
ttattattga 
atattgcaaa 
aaactttgaa 
attgaaggaa 
ttagagaatg 
taagtccatt 
atggaagtaa 
cttcgttttt 
aaaaatcatc 
ctcggcaaat 
ccagtcaccc 
tgcggagtac 
gtgtcataaa 
taaacacggt 
tttcttttta 
atcctatttt 
acacgtggat 



ttttttccaa 
cagaactata 
attttctccg 
tgtgtcagac 
gatcaaacca 
aaaaaattaa 
atcatgttcc 
gagaagtact 
tgatgtttga 
gatgagctga 
accggaatra 
catcctatta 
cgaaatccga 
ctctcgtcct 
tttttttcgc 
cttt batgca 
ctttttctac 
cttcaaattt 
aaatcttacc 
tcgtttgaag 
ccagaagagt 



atgactaaat 
atcatgttga 
tttgtcctct 
acaaggcaca 
gatracagaa 
aaatttcaaa 
ataaccggta 
cccgacaggc 
aggaagaaca 
aacagtaaat 
tccgagtatg 
aaatatcatc 
acacaaaatc 
ggagcgcgga 
gcttcttacg 
ttcagactgg 
gtctttttct 
gattttcgcg 
acacttagca 
aggcaaaata 
atacgcacaa 



36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 



tcacactatc 
aatggattgc 
caaaatgtat 
catgattact 
aacgacaatc 
atatatgaca 
aggagtggtt 
aaaaaaatat 
cgaagattag 
tgaaagaatt 
cgaacattga 
ttaaggatat 
attcgaattt 
aaatagataa 
cggtatatca 
cgtcaagaag 
agactgtggc 
aaatttctta 
tagaaaacct 
tagaagggcg 



cccttcgatt 
gattcaaaag 
ttagaaaaaa 
gtagatgcat 
gaattggccg 
ctttttaaat 
caaatgattc 
tggtgaatgt 
caagagtttg 
aaaaaaatcc 
gcagaagaag 
cacccagttc 
ttacagtaca 
tgaagctgcc 
ctacctggat 
tcggatacga 
tctcaagctt 
atttttctga 
tcaccacttg 
cgagattcag 



ttgacgcggt 
aaaacgtagc 
ctcacgctaa 
agagtgagaa 
ccatcatctc 
gaattgacag 
ataaagggaa 
ataatttttt 
gggataaccc 
tttcaaattt 
cttttaccaa 
aaaatagtag 
ttgatcgttt 
aagtttaaaa 
attagtttta 
aatgtataaa 
cagtgcttgg 
agaagtcgta 
agaatact cc 
aagtaggtga 
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acagaattct 
ttaatctcca 
taaggcggag 
tgagcacata 
attcttcgac 
aaatctgatg 
tacaatttct 
aggggtaaag 
ccgtgaagaa 
gagattcaaa 
taaatccaaa 
ttcaaaaact 
cgttatttga 
aaatcggggc 
gacttcatca 
tttcttaaaa 
agaaatagtt 
aaagtataaa 
agtctcaaat 
aaattagacg 



ggaatttttt 
gttaaagctg 
agtattgtct 
taagcgctcg 
ctcccgtttt 
ctaactacat 
gaatgatcaa 
taaataaata 
aaatatga aa 
ttttgttcat 
atttgttaag 
cgagtcttaa 
tcgctttttg 
taaggctaat 
gatatttagt 
cttaaaactt 
taatagtcag 
tgttgctaga 
tttccctcga 
gaaaactctc 



tttgaacttt 
attttcattg 
gtagaaccgc 
gctgtttttg 
atttctgaaa 
tttaacttgt 
agaagaaaga 
aacacaaggc 
aaaaatggtt 
ctgttctgtt 
agaatatagt 
ttttttcagt 
ataaaacaaa 
ggagcataca 
cagaaaagta 
cgagatatcc 
aatatgtttt 
tcaaacactc 
cgcggaagtg 
tcaaaattga 



37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 



aatcaatgaa 
cgtacacgca 
ttctcctaca 
ggcgacgggg 
atgtatgttt 
aagagattcc 
aaaaagctgt 
cttattctga 
tttctttttt 
tgaattggaa 
ttcagaagga 
tggagatccc 
tttttttaaa 
atttccaaaa 
ggttatttta 
ggtaaatcga 
tgctccatct 
ggcgtagaaa 
acgttttgga 
cagaaaagag 
gtcattgaaa 



taggacaact 
tacatcttat 
tctacatttt 
gacgactact 
gcgagagttg 
ccctcctcct 
ggaagggagg 
ctctacggaa 
tttgtagata 
agtgtcagtg 
atcaactaga 
tagattgatg 
aataaacctg 
tctaatatta 
acacatggtt 
cgcctacgtc 
tgcacacata 
tgtgcgaaat 
agtatgaatt 
aacctaagag 
attttaaaat 



gagacaatgt 
gttagagaag 
ttgctccgtg 
cgactctaat 
agagccccac 
tctgatcacc 
agaagaactt 
ataggaagcc 
caaatataaa 
ctcagaggga 
acatttgacg 
ccaactttat 
attgtctgaa 
ggggtttcta 
caccgccggc 
tcaacaatta 
cacatcgcct 
tcacggataa 
gggtctagac 
cgttgatggt 
tttaaatttt 
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gcaggtgtat 

tacgtgtgct 
agccacgccg 
tggccctaaa 
cgccacgagg 
tcgtctttcc 
accggctaaa 
tacttgtcaa 
aactaactgc 
atagccaatc 
cctgaaacct 
taaaaactaa 
tttctacaag 
ctaaaattca 
tcaaacttca 
gtttgtgcag 
gtaatgaaga 
cagagatttt 
aacggagtgg 
ttggtgacta 
gcttgcagtt 



gtgtatgcac 
ccgctcatca 
ggaaaaacga 
cgcaagtaaa 
agaagtgggg 
ttttttgttc 
tggaaaaaaa 
ttagaccgcc 
gtgtgatgca 
attgacagaa 
aacaagaaaa 
gtatacttat 
attgcgacca 
acgagaactc 
ttcttagtcc 
aaaataagta 
ggttcggagt 
tgatgtttca 
cagatgttcg 
acgaacttaa 
catctttctc 



atggcaccca 
tgtcttctcc 
cgacgacgac 
tttttaggca 
gaagattccg 
catttccgtg 
ggaactctaa 
ctcgcacaga 
gcagatatct 
atttgactac 
atctataatt 
atatatacga 
aattttccgt 
ttaacatta t 
tctgattttt 
aaaagagttg 
cagatgacta 
tcagacttac 
gaaaattttg 
aagaaaattg 
tattaacaaa 



38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
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aattattttg tagcrtttct caatttcagg caattaaaac atttcaattt attcttctat 40200 
tatggaagtt tatctctaat tgaaactctc caattttgat caaagaacaa acgttctcgt 40260 
tgtttgaaaa aaaaaacagt tcttttttga aactcgcgcg caaattatta accaatcatc 40320 
ctcgtttgcg cgcaaaattg tagaaaaaat catttaaatt tatcaaaaat agtttaccat 40380 
tctgatgagt ttttcatata caaaaatgcc ctggcaattg ttgttttctc tgaaatagca 40440 
cataataatt gaactctacc cacataaagt tcgttctgaa aaacacctta caattattgt 40500 
gattgagagc caccccaaga gggattagaa aaacggatgt aatctgtata ccttcgagat 40560 
tcgtttattt ccttgtataa ccaatagcag gaaaattaca gctttttcta agtaagcggt 40620 
gaaactagag agattctata gaatatgggc gttaataatt gtatgttaaa gttttagaat 40680 
aacacaagtc cagagtaagg gcaagaaaag taatgagcaa cggaaaccag catgcaagac 40740 
□ acccgaattc cggttctctt ctgaaactaa aagttgcgtg tactaaacct taaaccagca 40800 
i gctggctagt ctcaagaaat aatagaaaaa aggaaggaat gaagatatgg gaataataca 408 60 
aattgaaaat gttgtgtgag ctccgaataa ttttcaatat caaaaattta tgaattgtgt 40920 
ggacggctgt gtgtgcgtgt gcgtatgcgt cggcaagaaa aagaagcgac cgaataagaa 4 0980 
aatggttgat tcagtgaaca aaaaaagaga gaaagatatc caaacaaaat tattcaaaac 4104 0 
tattatcaat cggtaggtat tgctctagag cacacctttc tggacactca gcagacatgc 41100 
gtagagaggg attatgtggt acatatagtg gatggaggaa cagatattta taaatactta 41160 
tggaaaagag gatgaagata ggatgaggta gatgaattga gaagatttta aaatgataat 41220 
ggatattgaa tttgaataag gagattctaa attatccgaa gaacacaaac tatatcaaga 41280 
ctacaaaata atctagacga gtcccagttt tgcaaggtaa ggattaatct taaaaggatc 41340 
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ttttaaatat ttatttcaat gctcctataa attttaaaaa gtaggtgcat tctaatatgt 41400 
acagtgatta ggagatatgt gacgttacgt gaggtctcga taaagtacgg tattcgagct 414 60 
aaatttcaaa cattgtcaag gtagattcgg tacacagcca ccataaatgt tccactaaaa 41520 
atgtgttgtc cttctccttt ggaacacaaa tctagctgct gaactttttc acttcactac 41580 
atgtcaatgg gattgatatg catctaggac atttttttgg ttatcaatag tccgcatagc 41640 
ttgcgtaacc aatacaaccg attgtccaaa aaaatttgaa cactacaaaa cgtatttatt 41700 
attcggatac ccgttgcatt tcaatacaca agttgatact tgctgcccct cggggctctc 41760 
agacactcat tgactgaaaa cagacgattg ctcgtcgtcg tagtctgaag gctcggagag 41820 
ctgaggaaga tatgaggaca taatgaattg atgtgtgaga atgagaaaat gaaaaaggaa 41880 
aaatgagaaa aaaaagatga tgaagaatgt acaaatgaat aatcaagtag caatgacgag 41940 
aaaagaacca ggtccttttg gcaggcaatt ttcgaaattt tcagatcaaa tttgtcgcca 42000 
ttgcttctgg attaataatg gatgacgctt tgacaatggt gctcaataca agtgcaaaca 42060 
gattggtttg ggatggcgta tagaaataga gccggtgaga cgatgtgatg aagttctgag 42120 
agacgagatg tgatcgaggc gtttgtagtc gaggcaaacc gaggccgcat atggggttcc 42180 
gataggcaat cggagaccag tgtccatctg aaagagataa aagttattcg agttgtgaat 42240 
gttgcaagga aaattaaagg tacagtagag acaatcgaga cttttttcgg gaggacgcca 42300 
tctaaaaact gtggaagcac gtggctttgg tagcttgatg tcacagaagt tgattccata 42360 
agaattacat tagaaagctt gcgacgctaa atggataaat ctggtaacgg cttcctaata 42420 
gcaagttaag ttttttcaca ataaattttt cagaattgaa tagatgcatt ttataactta 42480 
cacatcgagt gggcacgttg gtggacaaga caagccccga t 42521 
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<210> 24 
<211> 4434 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 24 

gccgccctcg ccaccgctcc cggccgccgc gctccggtac acacaggatc cctgctgggc 60 
accaacagct ccaccatggg gctggcctgg ggactaggcg tcctgttcct gatgcatgtg 120 
tgtggcacca accgcattcc agagtctggc ggagacaaca gcgtgtttga catctttgaa 180 
ctcaccgggg ccgcccgcaa ggggtctggg cgccgactgg tgaagggccc cgacccttcc 24 0 
agcccagctt tccgcatcga ggatgccaac ctgatccccc ctgtgcctga tgacaagttc 300 
caagacctgg tggatgctgt gcggacagaa aagggtttcc tccttctggc atccctgagg 360 
cagatgaaga agacccgggg cacgctgctg gccctggagc ggaaagacca ctctggccag 420 
gtcttcagcg tggtgtccaa tggcaaggcg ggcaccctgg acctcagcct gaccgtccaa 480 
ggaaagcagc acgtggtgtc tgtggaagaa gctctcctgg caaccggcca gtggaagagc 54 0 
atcaccctgt ttgtgcagga agacagggcc cagctgtaca tcgactgtga aaagatggag 600 
aatgctgagt tggacgtccc catccaaagc gtcttcacca gagacctggc cagcatcgcc 660 
agactccgca tcgcaaaggg gggcgtcaat gacaatttcc agggggtgct gcagaatgtg 720 
aggtttgtct ttggaaccac accagaagac atcctcagga acaaaggctg ctccagctct 780 
accagtgtcc tcctcaccct tgacaacaac gtggtgaatg gttccagccc tgccatccgc 840 
actaactaca ttggccacaa gacaaaggac ttgcaagcca tctgcggcat ctcctgtgat 900 
gagctgtcca gcatggtcct ggaactcagg ggcctgcgca ccattgtgac cacgctgcag 960 
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gacagcatcc gcaaagtgac tgaagagaac aaagagttgg ccaatgagct gaggcggcct 1020 
cccctatgct atcacaacgg agttcagtac agaaataacg aggaatggac tgttgatagc 1080 
tgcactgagt gtcactgtca gaactcagtt accatctgca aaaaggtgtc ctgccccatc 1140 
atgccctgct ccaatgccac agttcctgat ggagaatgct gtcctcgctg ttggcccagc 1200 
gactctgcgg acgatggctg gtctccatgg tccgagtgga cctcctgttc tacgagctgt 12 60 
ggcaatggaa ttcagcagcg cggccgctcc tgcgatagcc tcaacaaccg atgtgagggc 1320 
tcctcggtcc agacacggac ctgccacatt caggagtgtg acaagagatt taaacaggat 1380 
ggtggctgga gccactggtc cccgtggtca tcttgttctg tgacatgtgg tgatggtgtg 14 40 
atcacaagga tccggctctg caactctccc agcccccaga tgaacgggaa accctgtgaa 1500 
ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccatcaa tggaggctgg 1560 
ggtccttggt caccatggga catctgttct gtcacctgtg gaggaggggt acagaaacgt 1620 
agtcgtctct gcaacaaccc cacaccccag tttggaggca aggactgcgt tggtgatgta 1680 
acagaaaacc agatctgcaa caagcaggac tgtccaattg atggatgcct gtccaatccc 1740 
tgctttgccg gcgtgaagtg tactagctac cctgatggca gctggaaatg tggtgcttgt 1800 
ccccctggtt acagtggaaa tggcatccag tgcacagatg ttgatgagtg caaagaagtg 18 60 
cctgatgcct gcttcaacca caatggagag caccggtgtg agaacacgga ccccggctac 1920 
aactgcctgc cctgcccccc acgcttcacc ggctcacagc ccttcggcca gggtgrcgaa 1980 
catgccacgg ccaacaaaca ggtgtgcaag ccccgtaacc cctgcacgga tgggacccac 204 0 
gactgcaaca agaacgccaa gtgcaactac ctgggccact atagcgaccc catgtaccgc 2100 
tgcgagtgca agcctggcta cgctggcaat ggcatcatct gcggggagga cacagacctg 2160 
gatggctggc ccaatgagaa cctggtgtgc gtggccaatg cgacttacca ctgcaaaaag 2220 
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gataattgcc ccaaccttcc caactcaggg caggaagact atgacaagga tggaattggt 2280 
gatgcctgtg atgatgacga tgacaatgat aaaattccag atgacaggga caactgtcca 234 0 
ttccattaca acccagctca gtatgactat gacagagatg atgtgggaga ccgctgtgac 2400 
aactgtccct acaaccacaa cccagatcag gcagacacag acaacaatgg ggaaggagac 24 50 
gcctgtgctg cagacattga tggagacggt atcctcaatg aacgggacaa ctgccagtac 2520 
gtctacaatg tggaccagag agacactgat atggatgggg ttggagatca gtgtgacaat 2580 
tgccccttgg aacacaatcc ggatcagctg gactctgact cagaccgcat tggagatacc 264 0 
tgtgacaaca atcaggatat tgatgaagat ggccaccaga acaatctgga caactgtccc 2700 
tatgtgccca atgccaacca ggctgaccat gacaaagatg gcaagggaga tgcctgtgac 27 60 
cacgatgatg acaacgatgg cattcctgat gacaaggaca actgcagact cgtgcccaat 2820 
cccgaccaga aggactctga cggcgatggt cgaggtgatg cctgcaaaga tgattttgac 2880 
catgacagtg tgccagacat cgatgacatc tgtcctgaga atgttgacat cagtgagacc 2940 
gatttccgcc gattccagat gattcctctg gaccccaaag ggacatccca aaatgaccct 3000 
aactgggttg tacgccatca gggtaaagaa ctcgtccaga ctgtcaactg tgatcctgga 3060 
ctcgctgtag gttatgatga gtttaatgct gtggacttca gtggcacctt cttcatcaac 3120 
accgaaaggg acgatgacta tgctggattt gtctttggct accagtccag cagccgcttt 3180 
tatgttgtga tgtggaagca agtcacccag tcctactggg acaccaaccc cacgagggct 3240 
cagggatact cgggcctttc tgtgaaagtt gtaaactcca ccacagggcc tggcgagcac 3300 
ctgcggaacg ccctgtggca cacaggaaac acccctggcc aggtgcgcac cctgtggcat 3360 
gaccctcgtc acataggctg gaaagatttc accgcctaca gatggcgtct cagccacagg 3420 



ccaaagacgg gtttcattag agtggtgatg 
ggacccatct atgataaaac ctatgctggt 
gaaatggtgt tcttctctga cctgaaatac 
tgattgaaag actgatcata aaccaatgct 
agaaaacccc caggatcact tctccttggc 
gactcctaga acgtgcgacc tgcctcaaga 
tcagcctcca atgaataaga catcttccaa 
gaaaaagcat ctacttgctt cagttgggaa 
agcagggtgc tattgtgagg ccatctctga 
tcagagaagg gaggactcac tagaattagc 
gaacacgggg agcagaggcc aaagcactaa 
gaagaaaata tggaggaact gttacatgtt 
gactattgct ggatttcatg atgctgactg 
acttaaatag aagcaggaaa gggagacaaa 
cacccttact catcacctgc agtggccaga 
ggcagtgctg gctgccattg cctggtcaca 
cttgtgcaga tgtagcagga aaataggaaa 

<210> 25 
<211> 2837 
<212> DNA 
<213> Unknown 

<220> 



-96- 

tatgaaggga agaaaatcat ggctgactca 3480 
ggtagactag ggttgtttgt cttctctcaa 3540 
gaatgtagag atccctaatc atcaaattgt 3600 
ggtattgcac cttctggaac tatgggcttg 3660 
ttccttcttt tctgtgcrtg catcagtgtg 3720 
aaatgcagtt ttcaaaaaca gactcagcat 3780 
gcatataaac aattgctttg gtttcctttt 3840 
ggtgcccatt ccactctgcc tttgtcacag 3900 
gcagtggact caaaagcatt ttcaggcatg 3960 
aaacaaaacc accctgacat cctccttcag 4020 
ggggagggcg catacccgag acgattgtat 4080 
cggtactaag tcattttcag gggattgaaa 4140 
gcgttagctg attaacccat gtaaataggc 4200 
gactggcttc tggacttcct ccctgatccc 4260 
attagggaat cagaatcgaa accagtgtaa 4320 
ttgaaattgg tggcttcatt ctagatgtag 4380 
acctaccatc tcagtgagca ccag 4434 



-97- 

<223> Description of Unknown Organism: Unknown 
<400> 25 

agagagccag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 

agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 

ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 

aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 240 

tggctctggg catagggcca agagcttctg ctggtgacca cgtcaaggac acttcatttg 300 

accttttcag catcagcaac attaaccgga agaccatcgg tgccaagcag ttccgagggc 360 

ctgaccccgg ggtgcccgcc taccgttttg tacggtttga ctacatcccc ccagtgaaca 4 20 

cagatgatct caacaggatt gtcaagcttg caaggagaaa ggagggcttc ttcctcacag 480 

cccaactgaa gcaggaccgc aagtctcggg gaacgctcct ggtgttggaa ggccccggca 540 

cctcccagag gcagtttgag attgtgtcca atggcccagg ggacactttg gacctcaact 600 

actgggtaga aggcaatcag cataccaact tcctggagga tgtgggccrg gctgactccc 660 

agtggaagaa tgtgactgtg caggtggcca gtgacaccta tagcctgtat gtgggctgcg 720 

atcttatcga cagtgtcacc ctggaagaac cattctatga gcagctagaa gtagacagga 780 

gcaggatgta cgtggccaaa ggtgcatctc gagagagtca cttcaggggc ttgctgcaga 840 

atgtccatct cgtgtttgca gattctgtgg aagatatctt aagcaagaaa agctgtcaac 900 

acagccaggg agctgaagtc aacaccaicca gtgaacatac agagactctc catctgagcc 960 
ctcacatcac cacagatctc gtggtccagg gtgtggagaa ggcacaggag gtgtgtacgc 1020 
actcctgcga ggagttgagc aacatgatga atgagctctc tggactgcac gtcatggtga 1080 
accagctgag caagaacctg gagagagtgt ctagtgataa ccagttcctt ttggagctca 1140 



ttgggggccc tctgaagaca agaaacatgt 
cagaaaatga aacctgggtt gtggatagtt 
cagtctgcca tcagatcacc tgctcacctg 
gcgagtgctg tccatcctgt tcacactctg 
cagagtggac cgagtgttct gtcacctgtg 
gtgatgtcac cagcaacacc tgcctgggcc 
gcaaatgtga tacgagaatc cgtcagaatg 
catgctccgt gacttgtgga gttggcaatg 
tgccccagat gggtggcaag aactgcaagg 
gtgatccgtg cccaattgat ggccgctgga 
ttacctgtgc tggagggatc cgtgagcgct 
atggagggaa ggactgtgtc ggggatgtga 
gccctattga tgggtgctta tccaacccgt 
ctgatgggtc ctggtcctgt ggctcctgcc 
gtgaggacct ggatgagtgt gctgtggtca 
cccgctgtgt caacaccaac ccgggcttcc 
ggaaccaacc cttcggtgtt ggcctggagg 
cagagaatcc atgtaaggac aagactcaca 
tgggccactt tagtgacccc atgtacaagt 
ggctcatctg cggggaggac tcagacctgg 
ctactaatgc cacctaccac tgcatcaagg 



-98- 

cagcctgtgt gcaggagggc cgaatctttg 1200 
gtaccacatg cacctgcaag aaatttaaaa 1260 
caacttgtgc caacccatct tttgtggaag 1320 
cagacagtga tgagggctgg tctccgtggg 1380 
gctctgggac ccagcagaga ggccggtctt 1440 
cctccattca gacaaggaca tgcagcctgg 1500 
gaggctggag tcactggtca ccctggtctt 1560 
tcacccgcat acgtctctgc aactcaccag 1620 
gcagcggccg ggaaaccaaa ccctgtcagc 1680 
gcccctggtc cccttggtca gcctgcacag 17 4 0 
cacgtgtttg caacagccct gagccccagt 1800 
cagaacacca aatgtgcaac aagagaagct 1860 
gttttcctgg agccaagtgc aacagcttcc 1920 
cagtgggctt tctgggcaat ggtacccact 1980 
cagatatttg cttctcaact aacaaagctc 2040 
actgcctgcc ttgtccacca cgctacaagg 2100 
atgctaggac agaaaaacaa gtgtgtgagc 2160 
gctgccacaa gaatgcagag tgcatctacc 2220 
gtgagtgcca gattggctac gcaggtgatg 2280 
atggctggcc caacaacaac ctggtgtgtg 234 0 
acaactgccc caaactgcca aattccgggc 2400 



-99- 

aggaggattt tgataaggat ggaatcggag atgcttgtga cgaggacgat gacaatgacg 24 60 
gtgtgagcga tgagaaggac aattgccagc ttctcttcaa tccccgtcaa ttagactatg 2520 
acaaggatga ggttggagac cgctgtgaca actgccccta tgtgcacaac ccagcacaga 2580 
tcgacacaga caacaatggc gagggggatg cctgctctgt ggacattgac ggagacgatg 2 64 0 
ttttcaatga gcgagacaat tgtccatatg tctacaacac tgaccagaga gatacggatg 2700 
gtgatggcgt gggtgaccac tgtgacaatt gtcctctgat gcacaaccca gatcagatcg 27 60 
atcaggacaa tgatctcgtt ggagaccagt gtgacaacaa tgaggacata gatgatgacg 2820 
gccaccagaa caaccaa 2837 

<210> 26 
<211> 4108 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 26 

agagagccag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 

agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 

ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 

aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 240 

tggctctggg catagggcca agagcttctg ctggtgacca cgtcaaggac acttcatttg 300 

accttttcag catcagcaac attaaccgga agaccatcgg tgccaagcag ttccgagggc 360 

ctgaccccgg ggtgcccgcc taccgttttg tacggtttga ctacatcccc ccagtgaaca 420 



-100- 

cagatgatct caacaggatt gtcaagcttg caaggagaaa ggagggcttc ttcctcacag 4 80 
cccaactgaa gcaggaccgc aagtctcggg gaacgctcct ggtgttggaa ggccccggca 540 
cctcccagag gcagtttgag attgtgtcca atggcccagg ggacactttg gacctcaact 600 
actgggtaga aggcaatcag cataccaact tcctggagga tgtgggcctg gctgactccc 660 
agtggaagaa tgtgactgtg caggtggcca gtgacaccta tagcctgtat gtgggctgcg 720 
atcttatcga cagtgtcacc ctggaagaac cattctatga gcagctagaa gtagacagga 780 
gcaggatgta cgtggccaaa ggtgcatctc gagagagtca cttcaggggc ttgctgcaga 840 
atgtccatct cgtgtttgca gattctgtgg aagatatctt aagcaagaaa agctgtcaac 900 
acagccaggg agctgaagtc aacaccatca gtgaacatac agagactctc catctgagcc 960 
ctcacatcac cacagatctc gtggtccagg gtgtggagaa ggcacaggag gtgtgtacgc 1020 
actcctgcga ggagttgagc aacatgatga atgagctctc tggactgcac gtcatggtga 1080 
accagctgag caagaacctg gagagagtgt ctagtgataa ccagttcctt ttggagctca 114 0 
ttgggggccc tctgaagaca agaaacatgt cagcctgtgt gcaggagggc cgaatctttg 1200 
cagaaaatga aacctgggtt gtggatagtt gtaccacatg cacctgcaag aaatttaaaa 1260 
cagtctgcca tcagatcacc tgctcacctg caacttgtgc caacccatct tttgtggaag 1320 
gcgagtgctg tccatcctgt tcacactctg cagacagtga tgagggctgg tctccgtggg 1380 
cagagtggac cgagtgttct gtcacctgtg gctctgggac ccagcagaga ggccggtctt 14 4 0 
gtgatgtcac cagcaacacc tgcctgggcc cctccattca gacaaggaca tgcagccrgg 1500 
gcaaatgtga tacgagaatc cgtcagaatg gaggctggag tcactggtca ccctggtctt 1560 
catgctccgt gacttgtgga gttggcaatg tcacccgcat acgtctctgc aactcaccag 1620 
tgccccagat gggtggcaag aactgcaagg gcagcggccg ggaaaccaaa ccctgtcagc 1680 



gtgatccgtg cccaattgat ggccgctgga 
ttacctgtgc tggagggatc cgtgagcgct 
atggagggaa ggactgtgtc ggggatgtga 
gccctattga tgggtgctta tccaacccgt 
ctgatgggtc ctggtcctgt ggctcctgcc 
gtgaggacct ggatgagtgt gctgtggtca 
cccgctgtgt caacaccaac ccgggcttcc 
ggaaccaacc cttcggtgtt ggcctggagg 
cagagaatcc atgtaaggac aagactcaca 
tgggccactt tagtgacccc atgtacaagt 
ggctcatctg cggggaggac tcagacctgg 
ctactaatgc cacctaccac tgcatcaagg 
aggaggattt tgataaggat ggaatcggag 
gtgtgagcga tgagaaggac aattgccagc 
acaaggatga ggttggagac cgctgtgaca 
tcgacacaga caacaatggc gagggggatg 
ttttcaatga gcgagacaat tgtccatatg 
gtgatggcgt gggtgaccac tgtgacaatt 
atcaggacaa tgatctcgtt ggagaccagt 
gccaccagaa caaccaagac aactgcccat 



-101- 

gccccrggtc cccttggtca gcctgcacag 1740 
cacgtgtttg caacagccct gagccccagt 1800 
cagaacacca aatgtgcaac aagagaagct 1860 
gttttcctgg agccaagtgc aacagcttcc 1920 
cagtgggctt tctgggcaat ggtacccact 1980 
cagatatttg cttctcaact aacaaagctc 2040 
actgcctgcc ttgtccacca cgctacaagg 2100 
atgctaggac agaaaaacaa gtgtgtgagc 2160 
gctgccacaa gaatgcagag tgcatctacc 2220 
gtgagtgcca gattggctac gcaggtgatg 2280 
atggctggcc caacaacaac ctggtgtgtg 2340 
acaactgccc caaactgcca aattccgggc 2400 
atgcttgtga cgaggacgat gacaatgacg 24 60 
ttctcttcaa tccccgrcaa ttagactatg 2520 
actgccccta tgtgcacaac ccagcacaga 2580 
cctgctctgt ggacattgac ggagacgatg 264 0 
tctacaacac tgaccagaga gatacggatg 2700 
gtcctctgat gcacaaccca gatcagatcg 27 60 
gtgacaacaa tgaggacata gatgatgacg 2820 
acatctccaa ctccaaccag gctgaccatg 2880 



-102- 

acaacgacgg caagggcgat gcctgcgact ctgatgatga caatgatggt grtccagatg 294 0 
acagggacaa ctgtcggctt gtgttcaacc cagaccagga agactcggac ggtgacggcc 3000 
gaggtgacat ttgtaaagat gactttgaca atgataatgt cccagatatt gatgatgtgt 3060 
gccctgagaa caatgccatc actgagacag acttcagaaa cttccagatg gtccctctgg 3120 
atcccaaggg gaccacacaa attgatccca actgggtaat tcgtcaccaa ggcaaagagc 3180 
tggtgcagac agcaaactca gaccctggca tcgctgtagg tttcgacgag tttgggtctg 3240 
tggacttcag tggcactttc tatgtcaaca ctgaccggga tgatgactac gctggctttg 3300 
tctttggcta tcagtcaagc agccgcttct atgtggtgat gtggaagcag gtgacccaga 3360 
cctactggga agacaagccc agtcgggctt acggctactc tggtgtgtca ctcaaagtgg 3420 
taaactccac gactggtact ggcgagcacc tgaggaatgc cctgtggcac acgggaaaca 34 80 
cagaaggcca ggtccggact ctatggcatg accccaaaaa cattggctgg aaagactaca 354 0 
ctgcctacag gtggcacctg attcacaggc ctaagacagg ctacatgaga gtcttagtgc 3600 
atgaaggaaa gcaagtcatg gctgactcag gaccaattta tgaccaaacc tacgctggtg 3660 
gacggctggg cctgtttgtc ttctcccaag agatggtcta tttctcggac ctcaagtatg 3720 
agtgcagaga tgcctagaga gcagggctcc agctccagca atgtgctgca aacacccctt 3780 
cttagacaca tcagtccatc ttggcacttg tggcttttct gtcatttggc atttcctgtt 3840 
tcttgacctt aactgagtgg atctacacct ccttcatcag caccaagtcc aagtgtcttc 3900 
aaaggagaaa catcaattgc actccaagag cttccagcct gctgctggaa aacatttgga 3960 
tgagatatga ggctcaccgt ggagcgaaga ccgagcattc cgctgtgttg ccttttcttg 4020 
tttgtttaaa aagaatgacg tttacatgta aatgtaatta cttgcagtat ttatgtgtat 4080 
atggagtcga agggagcttt agagcaca 4108 



-103- 



<210> 27 
<211> 820 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 27 

tcgaccagag gaggggaggc cagttcctct cccaagggtg ccacacaccc ctccctgttc 60 
atcaccagac aggcccttcc ttcttagcca tatgctaacc ttctcctccc tgggaaattt 120 
cctctgcagg agccaaagca gatgggagct ggagttgctg gagctcctgg tctgtatgca 180 
gagcaggcat ccaggaaagg agaagagagt gtgacaatcc agcacctcag aatggagggg 240 
cctcgtgttc agggcggaaa gtacagacgc aggcttgctg agggcctctg gacacaggct 300 
ggaccagatg ctgtggatgt cgacccctgc actgactatt ggataaagac ttctttcaac 360 
taagagaaga tgcaaatcag cacacttttt tctttgttct gccagcttcc aggcctaaga 420 
ctaggttttg ctgtctacag ccaactattc tattagttac aaaactcaat cattttattc 480 
agcaactgga tgttgactgt taactagaag ctctgtccta cttacagcac tttggatcat 540 
caaaaaaata aagtaaaata gaaaactgag aaaactcaat ccatgaccag ggagaactta 600 
caggatgtta gagacaaaac aagcagacac ctgaaacaat caacgcccaa taaaacaaag 660 
taggatgaaa attctcttag ttctttgata acaatrtgtt cactcataga aacattatta 720 
attggtaggg taagcagaca ctctgaaaca atgagaaaaa tactaaaaat tgacttgagt 780 
tatttcaaat tgcctcattg acctgttata tcataactct 820 

<210> 28 



-104- 

<211> 2397 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 28 

tttttttttt catcctactt tgttttattg ggcgttgatt gttacaggtc ccagcctgta 60 

gacatctttt actccaattt cctgaataga tagctttatt ccttcaaggt aatatagtgc 120 

ggtggcttct ggctgagatg tttgctgttg ttttcttcat cttgtctttg atgacttgtc 180 

agcctggggt aactgcacag gagaaggtga accagagagt aagacgggca gctacacccg 240 

cagcagttac ctgccagctg agcaactggt cagagtggac agattgcttt ccgtgccagg 300 

acaaaaagta ccgacaccgg agcctcttgc agccaaacaa gtttggggga accatctgca 360 

gtggtgacat ctgggatcaa gccagctgct ccagttctac aacttgtgta aggcaagcac 420 

agtgtggaca ggatttccag tgtaaggaga caggtcgctg cctgaaacgc caccttgtgt 4 80 

gtaatggaga ccaggactgc cttgatggct ctgatgagga cgactgtgaa gatgtcaggg 540 

ccattgacga agactgcagc cagtatgaac caattccagg atcacagaag gcagccttgg 600 

ggtacaatat cctgacccag gaagatgctc agagtgtgta cgatgccagt tattatgggg 660 

gccagtgtga gacggtatac aatggggaat ggagggagct tcgatatgac tccacctgtg 720 

aacgtctcta ctatggagat gatgagaaat actttcggaa accctacaac tttctgaagt 780 

accactttga agccctggca gatactggaa tctcctcaga gttttatgat aatgcaaatg 840 

accttctttc caaagttaaa aaagacaagt ctgactcatt tggagtgacc atcggcatag 900 

gcccagccgg cagcccttta ttggtgggtg taggtgtatc ccactcacaa gacacttcat 960 
tcttgaacga attaaacaag tataatgaga agaaattcat tttcacaaga atcttcacaa 1020 



-105- 



aggtgcagac tgcacatttt aagatgagga aggatgacat tatgctggat gaaggaatgc 1080 
tgcagtcatt aatggagctt ccagatcagt acaattatgg catgtatgcc aagttcatca 114 0 
atgactatgg cacccattac atcacatctg gatccatggg tggcatttat gaatatatcc 1200 
tggtgattga caaagcaaaa atggaatccc ttggtattac cagcagagat atcacgacat 12 60 
gttttggagg ctccttgggc attcaatatg aagacaaaat aaatgttggt ggaggttrat 1320 
caggagacca ttgtaaaaaa tttggaggtg gcaaaactga aagggccagg aaggccatgg 1380 
ctgtggaaga cattatttct cgggtgcgag gtggcagttc tggctggagc ggtggcttgg 144 0 
cacagaacag gagcaccatt acataccgtt cctgggggag gtcattaaag tataatcctg 1500 
ttgttatcga ttttgagatg cagcctatcc acgaggtgct gcggcacaca agcctggggc 1560 
ctctggaggc caagcgccag aacctgcgcc gcgccttgga ccagtatctg atggaattca 1620 
atgcctgccg atgtgggcct tgcttcaaca atggggtgcc catcctcgag ggcaccagct 1680 
gcaggtgcca gtgccgcctg ggtagcttgg gtgctgcctg tgagcaaaca cagacagaag 1740 
gagccaaagc agatgggagc tggagttgct ggagctcctg gtctgtatgc agagcaggca 1800 
tccaggaaag gagaagagag tgtgacaatc cagcacctca gaatggaggg gccbcgtgtc 18 60 
cagggcggaa agtacagacg caggcttgct gagggcctct ggacacaggc tggaccagat 1920 
gctgtggatg tcgacccctg cactgactat tggataaaga cttctttcaa ctaagagaag 1980 
atgcaaatca gcacactttt ttctttgttc tgccagcttc caggcctaag actaggtttt 2040 
gctgtctaca gccaactatt ctattagtta caaaactcaa tcattttatt cagcaactgg 2100 
atgttgactg ttaactagaa gctctgtcct acttacagca ctttggatca tcaaaaaaat 2160 
aaagtaaaat agaaaactga gaaaactcaa tccatgacca gggagaactt acaggatgtt 2220 



-106- 

agagacaaaa caagcagaca cctgaaacaa tcaacgccca ataaaacaaa gtaggatgaa 2280 

aattctctta gttctttgat aacaatttgt tcactcatag aaacattatt aattggtagg 2340 

gtaagcagac actctgaaac aatgagaaaa atactaaaaa ttgacttgag ttatttc 2397 

<210> 29 
<211> 4100 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 29 

ggatcccccc gctccgctac catcttcatc gacctcaccc aggacgacga ctgagctccc 60 

tcttcctcgc cgcggactgg ggcgaccctg ttgctgctgc ggccgccgcc gctcctgccc 120 

ccacttcggc tcccgctcct gctcctgctc ccggccccac tcctgttcct gttcctgttc 180 

ctgttcctgt tcccggtcct gctccggctc ccggccccgc acccacctcc gcrcctgctg 240 

cgggtctcca ggcccagaca aaataaaaaa agatatattt tttcagtccg tctctcccgc 300 

ccggtgtctt ctatggctga gggagtctgg ctctcggggc tctcgggtcg gctgggcggc 360 

tcggctggtt ggctggctgg cgagatggac cgctccggcg cgcagcgtcc gcggctgctg 420 

tgatgggtgg gcggagcgcg gaccggggat tatatacacg atgtgcatcc ataattgatg 480 

ttgtttgaga aaaacaaagt cataaagtgg cactcagaca gcactttggc ctggcgcccg 540 

gccaccatct gagtgcccaa ccgggcccgg cggttacatc acccccacat ggaccatcac 600 
ggcccattag caccaattgg ccagagtgtc gggagccacc gctaattgca gtaacgcgcg 660 
gctgccagac tgcaatttac cgcgcgatac tgcagtttac tgcagccgcg gtaaactgca 720 
gtacgcggcg gccgcaggaa atctactgta gtatttggcg gcggagcgcg gtactgcaac 780 



-107- 



tgtagtaaac tgtagctgca gtagagttac tgcagcgcca tcgggccggt gtggccgcca 840 
gggtaactgc acccgcagta aatttactgc agccggactt tgtgcgctgt ggagaccgcg 900 
ccgaactggg acccccccga ctcccccccg actccccccc gactcccccc cgactccccc 960 
ccgactcccc cccgactccc ccccgactcc cccccgactc ccccgggacg cgtccgcgcc 1020 
tcgatgcgcc ccatcgcgcc ccgttccgct tcgccacgct ccagttgccc cgcccccggc 1080 
acgtggcacg tatttccccc ccgtaaatca agagggatta tgcggatgtc tagtttatgt 1140 
ctcaatttcc tctttccgga gataaaagcc gggacccccg cgccgaaaaa ggatacacca 1200 
gccgcgatgt cgccgctcgt ggcggtgctg gtgttttttt cggcggcgct gggggttcct 1260 
ggccccggcg tcgcgggaaa cccccgtggg ctcgatgcca tcttcgaggc cccggtcacg 1320 
cccgcgcccc ccactcgcca tcctcggcgc gaggagctgg agtgggacga tgaggatcac 1380 
ccgctgctgg acctcgagcc gcccgtggga tcacgctgcc atccctacat cgcgtactcg 1440 
ctgccgccgg acatgaacgc cgtcacgagc gtggtcgtga agccctactg ctcgccgccg 1500 
gaggtcatcc tgtgggcgtc tggcaccgcc tacctggtca acccctttgt cgccatccag 1560 
gccctggccg tcggagagcc cttaaatgag gcggccctca aggagctcgg agaggtggcc 1620 
gtgcacaagg actccctgcc gccgctgcgc tataatggag ggccccccgc cgagtaagag 1680 
accctgcggc ctgccgcccg gggtgcgcct cgtcgtgcct gccgccgccg ccgcttctgc 174 0 
ctctaacgcc gccaccgccg ctgcagcagc agcccccgcc ggggccgggg ccggggcctc 1800 
gaagccggcc cgaccccccg ccgccgcccg gcccgcgaag ggcacgcccg cggcgtcggc 18 60 
ggcaacaaca gccacggggg ccgacgcctc cgccccggcc cccgaccccg gggcgcccac 1920 
gtgggacgcc ttcgccgccg agttcgacgt ggccccctcg tggcgcgcgc tgctggagcc 1980 
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cgagatcgcc aagccgtacg cgcgcctgct gctggccgag taccgcggcc gctgcctgac 204 0 
cgaggaggtg ctgcccgcgc gcgaggacgt gttcgcctgg acgcgcctca cggcgcccga 2100 
ggacgtcaag gtggtcatca tcggccagga cccgtaccac gggccgggcc aggcccacgg 2160 
gctggccttc agcgtccggc gcggggtgcc gatccccccg agcctggcca acatcttcgc 2220 
ggcggtccgg gcgacgtacc cgacgctgcc cgcgcccgcc cacggctgcc tggaggcctg 2280 
ggcgcgccgc ggggtgctgc tgctgaacac gacgctgacc gtgcggcgcg gggtccccgg 234 0 
ctcccacgcc ccgctcggct gggcgcggct cgtgcgcgcc gtcgtccagc ggctctgcga 24 00 
gacccgcccc aagctggtgt tcatgctctg gggcgcccac gctcaaaagg cctgcgcgcc 24 60 
ggacccgcgc cgccacaagg tgctcacctt cagccatccg tcgccgctgg cccgcacgcc 2520 
cttcaggacc tgcccgcact ttggagaggc gaacgcgtac ctcgtccaga cgggccgggc 2580 
ccccgtcgac tggagcgtgg actgagtcgg gcgtgcgcgc acaccgccgg cggaggacga 2 640 
ggagggggga ggggggtggg atggacggag gagagcggat gatggagccc gcgctcgccg 27 00 
gcgccccggc cagcgcgctg ccggtcctgg cggtgctgcg cgagtgggga tgggccgtgg 27 60 
aggaggtcga gccctccggg ccgtgcccgg aggacgcgga cgcgccccgg gagagcgcac 2820 
cccctccccg ggagggggtg cgcgggagcg aagacggaga ggggggcgtg gaagacggcg 28 8 0 
aggaggggaa ggcgacggag aaggaggaga cggaagacga ggaagacggg ggggacgaag 294 0 
ggacgacgac ggcggcggcg ggcccgcgcc gggcgcagca cgtggagttt gacacgctgt 3000 
ttatggtcgc gtccgtggac gagctcgggc gccggcggct gacggacacg atccgccggg 3060 
acctggccgc ggccctggcc ggcctccccg tcgcctgcac caagacgtcc gcgtttgcgc 3120 
gcggcgcgcg cggcccgcgc ggcgcccccg ggcgcggcca taaaagcctg cagatgttta 3180 
tcctgtgccg cagagcccac gcggcgcgcg tacgcgatca gctccggtcc gcggtgcgcg 3240 
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cccgacgccc acgcgagccc cgcgcgcgcc cgacgagcgg acgggcgcgg ccggccgcgc 3300 
cggtgttcat ccacgagttc atcacccccg agccggtgcg gctgcaccgg gacaacgtgt 3360 
ttgcggcgcc atgagcacct tcggacgcgc gtccgtggcc acggtcgatg actaccaccg 3420 
gttcctgcag gccaacgaga cggccgcccg gcgcctggcc gcggcctccc gccgcgtctc 3480 
caccggcggg ggcgagacgc gggccccgcg gtcctcgcgc ggcccccacg acgatgaggc 3540 
gcccctgcgc gccggcggcc tgggcaccgc ccgcgggcgc tcgcgccagc gcggcgcgac 3600 
cgagccggac cccgtctacg ccaccgtcgt ccagcctacc caccaccacc accagcagca 3660 
ccaccaccgc tctcagcatc cgcagcagca gcaacaacag cagcgggccc cacgccgccg 3720 
cggcagcgtg cacgcctcgg cgacggccgc ggacggaccc gagtcgtgcg cggccgcacc 3780 
cccgcgccgc cgcggcagcg tgcacgcctc ggcgacggcc gccccggcgg tccagctgcc 3840 
ccggccccgg caacggagca tcaacgcctc gacgacggcc gccccgacgc cccagctgcc 3900 
gagaccccgc cagcgcagcg tcaacgcctc ggcccgcgcc gccgtcccct cgacggccac 3960 
cctcccgcgc ccccggaccc cgtcccgggg ccggcgcgcg ccccccgcct catgctgtta 4020 
tcgcgatcaa taaagggcga gcgcccacgg accagacaaa agacacaacc ggttcggtct 4080 
ctctgtccgc gcacgcgcgg 4100 



<210> 30 
<211> 38734 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 30 
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gatcctcgtg accgggtaca ccgacgcctc ctggacgccg ctgttcgcca tcgcgggcgg 60 
ggtcgtcacc gacatcgggt cgatgctctc gcacagttcc atcgtggccc gcgagttcca 120 
cgtcccgtcg gtggtgaaca ccaaggacgc cacccagcgc atcaacaccg gcgacctgat 180 
cgtggtggac ggcgacgcgg gcacggtcga ggtcgtcgag agcgcggaca ccgacccgca 24 0 
gggcccggcc ggggccgccg ggaccccggc cggagccacc accgactgaa gccggccacc 300 
gccgcaacac cggaccacga ccgcccccgc gaggggcgga ccacacccca gacgggagac 360 
gacccgatga tccccaacca gtggtatccc atcgtcgagg cgcaggaggt gggcaacgac 420 
aaaccgctcg gtgtgcgccg catgggccag gacctcgtgc tctggcgcga catcgacggc 4 80 
aacctcgtct gccagggcgc ccgctgcccg cacaagggcg ccaacctcgg cgacggccgc 54 0 
atgaagggca acaccatcga atgcccgtac cacggcttcc gctacggagc cgacggtgcc 600 
tgccgggtga tcccggcgat gggctccgag gcccgcatcc ccggctcgct gcgggtaccc 660 
acctacccgg tccgggagca gttcggcctg gtgtggatgt ggtggggcga cgagcgcccg 720 
acggccgacc tgccgccggt ggcggccccg gccgaggtga cggacaaccg gaagctgtac 780 
gccaccaagc gctggacccg cccggtgcac tacacccgtt acatcgagag cctgctcgag 840 
ttctaccacg tgacctacgt gcaccgggac cactggttca actacatcga ctacctgctc 900 
ctgtacggca ccccgagcaa gttcggcctc gacggccgcg agcggtacct ggccgccacc 960 
cggatcacca accaccgggt ggagacggag gcggaggggc agaccatccg ctactccttc 1020 
gaccactgcc aggaggacga ccccaccaac accacccact acgtcatcac gttcaccttc 1080 
ccgtgcatgg tgcacgtgca gaccgagcag ttcgagacca cctcctggct ggtgcccatc 114 0 
gacgaccaga acaccgagca catcctgcgc tggtacgagt acgaacaggt caagcccgtc 1200 
ctgaggttcg aaccgctgcg ccgtctgctg ccctgggcgt ccctctacat ggagaagtgg 1260 
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gtgcaggacc cccaggacgt ccgcatcatg gaacaccagg aacccaagat cagcgccggc 1320 
ggcgtgaaca agttcatccc cgtcgacgag atgaacgcca agtacatctc gatgcgcgcc 1380 
aagctgatcg cggacgcctc ggccgcgccc tcgtcaccgg cgcgggcggc ggagcccgag 14 4 0 
ccggaagcgg cggggcgggg cggatcagcg gcccgtgcca cgggcaacgg caggggagcg 1500 
gccggcggac gacgcggcac caagcccaag gaggacgccg ccgcgcgccc gtagacccga 1560 
agacggggga cggacaagag agagcgagag tgagagatgt acggcggata cgacgcgtcg 162 0 
accggcccca aggccctggt gacggccttc aacaccgtcg ccgtggccgg cgccgtgtgg 1680 
ttcctgttcg gcggcgcgga caccgtggcc gactggttcg gcaccgactt cgacgaggcg 1740 
gtgaccctgc gccgggtcct gctggcgacc ctgtcggtgc tctacctgct gcgcttcatc 1800 
gccacgaact tcgtgatgct ccagcgcaag atggagtggt cggagtcggc caccatcggg 18 60 
atctgggtcc tggtgatcca cggcacgatg gcgtacttcg gcggcaccaa cgacgccggc 1920 
gtgagcgcgt tcacctggct gggcgtcgtg ctgtacctcc tcgggtccta cctgaacacg 1980 
gggtcggagt accagcgcaa actctggaag aagcgcccgg agaacaaggg caagctctac 2040 
accgaaggcc tgttcaagca ctcgatgcac atcaactact rcggtgacgc cgtgctcttc 2100 
tccgggttcg cgctggtcac gggcaccccg tgggccttcg ccatccccct gatcatggtc 2160 
tgcatgttcg tcttcctgaa catccccatg ctcgacaagt acctcgccga gcgatacggc 2220 
gaggccttcg acgagtacgc gtcccggacg gcgaagttcg tcccctacgt gtacrgaccc 2280 
cgcccgtcac gcgcgtacgg cggcctcccc gggcgagggg ggccgccgta ccgggtggca 234 0 
accacagatc ccacagatcc ccacagatcc ccacagagcc cctccacaga ccccctccag 2400 
agatccacag atcccctcca cagatccgag acgaggcacg tatgaccgga gacattccct 24 60 
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tcggagaggc cgaggcgtcc ctgaccgccg aggtgctgcg cgaggtcctg gccggcggcg 2520 
ccgaggcgtt cgcccggctg acctccgacg agggcgccgt cgacgacttc ggcttcgacc 2580 
cggagctgac cgacgactac ctgctccccg ccctgcgcct gctgtacgag aagtacttcc 2640 
gggtcgacct ggagggactg gagaacgtgc cggccgaggg gggcgcactc ctggrcgcca 2700 
accactccgg caccctgccg ctcgacgccc tgatgctcca ggtggcgctg cacgaccatc 27 60 
acagcacgca ccgcaggctc cggctgctcg ccgccgacct tgccttcgac ctccccgtcg 2820 
tccgtgacct cgcccgcaag gccggccacg tacgcgcctg ccccgagaac gcgctgcggt 2880 
tgctcggctc cggcgaactg gtcggcgtga tgccggaggg ctacaagggg ctcggcaagc 2 940 
ccttcgagga gcgctaccgg ctgcagcgct tcggccgggg aggcttcgcg gcggtggcac 3000 
tgcggtcgcg gcgccccatg gtgccgtgct cgatcgtcgg cgccgaggag atctacccga 3060 
tgatcggctc ggcccccacc ctggcccgga tgctgaagct gccgtacttc ccgatcaccc 3120 
cgaccttccc gctgctgggc gcgctgggcc tgatcccgat gccgaccaag tggaccatcc 3180 
gcttcggtgc cccgatccac acggacggct tccccgagga cgccgcggag gacccgctgg 3240 
tggtcgagaa gctcgccggc gaggtgaagg acaccatcca gcacacgctc aacgagatgc 3300 
tggagggccg cggctccccg ttcgtctgag ggccgcggct cccggttcgc ccgagggcgg 3360 
cggctcccgg ttcgcccgag gaccgtccct ctcgtccggg gccccgcctc agccccccgc 3420 
cgacgatccc cggcggcaga tgctgcgaac gctggcgaag gccagaacgg cgaggccgac 3480 
gagcgtgacg ccgccgccga ccagctccgc ggacagatgc atgggatctc cctcaggggg 354 0 
acgacggacg gtgatggtca tatagccatg cgaaccccgc cgtccgcccg atccgcagcc 3600 
gcaccgcccc gcgaattcac ccgtagagca gaccggtgcg gccgaggagg ggtggcgatt 3660 
gggtggtcgc gcgttcgaac gcttacgatc ctctgttgtg tccaaactga ccgacgtgcc 3720 
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caagcggatc ctcatcgggc gcgcactgcg cagcgaccgg ctgggtgaaa cgctcctgcc 3780 
gaagcgcatc gcgcttcccg tgttcgcgtc cgacccgctg tcctccgtgg cgtacgcgcc 3840 
cggcgaggtg ctgctcgtcc tgtccatcgc gggcgtgtcg gcctaccact tcagcccgtg 3900 
gatcgcggtc gcggtcgtgg tcctgatgtt caccgtggtc gcctcctacc ggcagaacgt 3960 
gcacgcctac ccgagcggcg gcggcgacta cgaggtggcc accaccaacc tcgggcccaa 4020 
ggccggtctg accgtcgcca gcgccctgct ggtcgactac gtcctgaccg tcgcggtctc 4080 
catctcctcc ggcatcgaga acctgggctc cgcgatcccc ttcgtcgtcg agcacaaggt 414 0 
cctgtgcgcg gtcgccgtga tcctgctgct cacgctgatg aacctgcgcg gggtcaggga 4200 
gtcgggcacc ctgttcgcga ttccgacgta cgtcttcgtc gcgggcgtct tcatcatgat 42 60 
cgtgtggggg gcgttccgcg gactggtcct ggacgacacc atgcgtgccc cgaccgcgga 4320 
ctacgagatc aagccggagc acggcggcct ggccggcttc gcgctgatct tcctcctcct 4380 
gcgcgccttc tcctccggct gtgccgcgct caccggtgtc gaggcgatct ccaacggcgt 4 4 40 
cccggccttc cgcaagccca agtccaagaa cgcggggaac accctcgcga tgatgggtct 4500 
gctggccgtc accatgttct gcggcatcat cgcgctggcc gccgcgaccg acgtgcggat 4560 
gtcggagaac ccggccaccg acctcttcca caacggcgtc gcggtcggcg cggactacgt 4 620 
ccagcacccg gtgatctcgc aggtcgccga ggcggtcttc ggcgagggca gcttcctgtt 4 680 
catcgtgctg gccgcagcca ccgcgctggt cctcttcctc gccgccaaca ccgcgracaa 474 0 
cggcttcccg ctgctcggct cgatcctcgc ccaggaccgc tacctgccgc gccagctgca 4800 
cacccgcggc gaccgcctgg ccttctccaa cggcatcgtg ctcctcgccg gagccgccat 4860 
gctcctggtc gtcgtctacg gcgccgactc gacccggctg atccagctct acatcgtcgg 4 920 
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cgtcttcgtg tccttcacgc tcagccagat cggcatggtc cgccactgga accgcaacct 4980 
ggccggcgag cgggaccagt ccaagcgacg ccacatgatg cgctcccgcg cgatcaacgc 5040 
cttcggcgcc ttcttcaccg gcctcgtcct ggtggtggtc ctggcgacca agttcacgca 5100 
cggcgcctgg gtcgcgctgc tcggcatgtg catcttcttc gcgaccatga cggcgatccg 5160 
caagcactac gaccgggtcg ccgaggagat cgcggccccg gaggaccccg aggaggcaca 5220 
gagcgacgac atggtgcgcc cctcacgcgt tcactcggtg gtcctgatct ccaagatcca 5280 
ccgccccacg ctccgcgccc tcgcctacgc caagctgatg cgctccgaca gcctggaggc 534 0 
gctcagcgtc aacgtcgacc cggccgagac gaaggcgctg cgcgaggagr gggagcgccg 5400 
cggcatcgcc gtaccgctga aggtcctgga ctcgccgtac cgcgagatca cccggccggt 5460 
catcgagtac gtcaagagcc tgcgcaagga gtccccgcgc gacgcggtct cggtgatcat 5520 
ccccgagtac gtggtcggcc actggtacga gcacctgctg cacaaccaga gcgccctgcg 5580 
cctcaagggc cggctgctgt tcacgccggg cgtcatggtc acgtcggtcc cgtaccagct 5640 
ggagtcctcc gaggccgcca ggcgccgggc gcgcaagcgc caggactgga gcgcgccggg 5700 
tgcggtgcgg cgcggaccgg cccaccacca ccaggaccgt gaccgtacga aggactcctc 57 60 
ctcgtccacg tagactggac ggctgttgtc cctgtcatcc ccccgttctc tggagtcacc 5820 
ccgccatgca ggcagaaccg aagaagtcgc aggcggaaca gcgagcggtc gcggagccgg 5880 
tctcggagcc ggtctcgctg gtgggcgagg agtacgaggt cgaggtcggc cccgtcgccc 5940 
acggcggcca ctgcatcgcc cgcacgtccg agggccaggt gctgttcgtc cggcacacgc 6000 
tgcccggcga gcgggtcgtg gcccgggtga cggagggcga ggagggtgcc cgcttcctgc 6060 
gggcggacgc ggtcgagatc ctggacccct ccaaggaccg catcgaagcc ccctgcccct 6120 
tcgccggccc cggccgctgc ggcggctgcg actggcagca cgccaagccg ggcgcccagc 6180 
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gacgcctgaa gggcgaggtg gtcgccgagc agttgcagcg cctggcgggt ctcaccccgg 6240 
aggaggccgg ctgggacggc acggtgatgc cggccgaggg cgacaagctg ccggccggcc 6300 
aggtcccgtc gtggcgcacg cgcgtgcagt tcgcggtgga cgccgacggt cgcgccggtc 6360 
tgcgccgcca ccgctcccac gagatcgagc cgatcgacca ctgcatgatc gcggcggagg 6420 
gcgtcagcga actgggcatc gagcgccgtg actggcccgg catggcgacg gtcgaggcga 6480 
tcgcggcgac gggctcccag gaccgccagg tcatcctgac cccgcgcccc ggcgcccgcc 6540 
tccccatcgt cgaactggac cgcccggtct cggtcatgcg cgtcggggag aaggacggcg 6600 
gcgtccaccg cgtccacggc cgccccttcg tccgcgagcg cgccgacgac cgcacctacc 6660 
gcgtcggctc cggcggcttc tggcaggtcc acccgaaggc cgccgacacc ctggtcaccg 6720 
cggtcatgca gggcctgctg ccccgcaagg gcgacatggc cctggacctc tactgcggcg 6780 
tcggcctctt cgccggcgcc ctggccgacc gcgtcgggga ccagggagcg gtcctcggca 6840 
tcgagtccgg caagcgcgcc gtcgaggacg cccgccacaa cctcgccgcc ttcgaccgcg 6900 
tccgcatcga gcagggcaag gtcgagtccg tcctgccccg caccggcatc gacgaggtcg 6960 
acctcatcgt cctcgacccg ccccgcgccg gcgccggccg caagacggtc cagcacctct 7020 
cgaccctggg cgcccgcagg atcgcctacg tggcctgcga cccggccgcg ctggcccggg 7080 
acctggggta cttccaggac ggggggtacc gggtgcggac gctgcgggtg ttcgatctgt 7140 
tcccgatgac tgcgcacgtt gagtgcgtgg cgattttgga gcccgccgca aaggggctct 7200 
gacctgcatt tttcttggct ggatcaggag cggcctgttg cgctcgacct gttctccaaa 7260 
gcgcacgacg tagagcttgc ggaccgctcg tgaaagccgc ctgacctggc gttgcacgag 7320 
cggtgccgcg atgtcggcgt ggtcggccct tctcctggcg cgaagggaaa ccgaaggtct 7380 
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tgacgctcgg gtgacgctat ttctgaaggg tcgtcaccga ctggggaggc agggccctgc 7440 
ctctcgcgcc cgatgaagca ggttctctct gctccaggta atcgtcgagg gtgccctgac 7500 
ggatcaggta gacggtcagg gaccgcaggg tgcagggcgt cgacgcccag gtcgaggatc 7560 
atcagggcgc tgtcattggt gatggcgaag gcgccgatca caccgtcgac ggaacaggtt 7620 
gcgttcagga gtctgtgcgg gcgccgcacc ggcacggtct ggaagctcgg ctccgaccgc 7680 
agctcgccac cagtccgaga ggagccgata ctgtccggtg ccgggtgacc cttcgtgcaa 7740 
gcgttgctgc ccccgctcgg cagaccgggg cagcaacgct tgcacgatcg gccggtactc 7800 
aacgggatcg tgtggagttt cggaccggaa cggcttggca ggacgtgccc gagcggtacg 78 60 
gctcctgggc cacattgcac acccgcttcc gtcgatgggt gaaaggcggc acctttcagt 7920 
gaaagggggt tccgcccccc cccgggacct tgcgcccacc gtcgccgacc ggctgatgaa 7980 
ccggctccgc gctcccgcca ccaacctgac ccgacgtgag accgaagtcc tctcaccggt 8040 
cgccgacgga ctgtccgacc aggccabcgg cgcacgcctc cacttgaccg aaggcaccgt 8100 
cgatatcacc tggcctgcat ctatgccaac ctcggaaccg actcgcgcac cgccgctgtg 8160 
gccactgtca ccgccatcga cgacctcggg ctcatccgcc gctgaacagt atgtggtggg 8220 
cggtgtttcc gttctccacg acttcagcgg cgtccggagt tgtggtgctg gctgggcttg 8280 
gtgcccgctc ctctgaaccc atgtgaacgc ccacggccag ttcgagccgg acacgccccc 8340 
cggacctggc ctccgccgcg gccagaatgc ccggcccctg cacctggtct gcctacctgt 8400 
acaggcgagg gcggtccctc ggagccactg cctgtaactc cgaggggccg cccttggccg 84 60 
actcggcggt caccgcggac gcggtgcggt caggaggcac cgtctgcgtc atcaggcgcg 8520 
gctcggccga gcgagttatt ccggcacccg tgggaccaga aatgtcagcc ctgcgtgacc 8580 
gcttcgaaga ccgtgacgcg gttgtcgtcg gagagcgagt gggtggggtc ggcgtgcgcg 8640 
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gcgcggaatg cctccgacga ggtgtaggcg gtgaatgcgg cgtcgtcctc gaagttgagg 8700 
acggccaggt agccgtgtgc gcccttgcgg ggacgcagca gccgtgcgtt gcgcagcccc 87 60 
ggcacgttgg aaagggtggc gcgcatgctg gcggtgcagt tgttctcgaa cgcgccctgg 8820 
gcgggcgcgg cgacggtgaa ctcggtgacg gcggtgctca tttctgtctt ctctcggttg 8880 
ttggtgtgat gtcggtggct gtcccgcccg gccggggccc gcacggcgat ggcgatgatg 8940 
tgcaccgcgt gtccgatcga gttccgttgc ggggtgcggt tacagggctg gagttgggct 9000 
cgggtccgcg gtcggctgag ggagcctgcc tgtgcggcgg gcccagattt cgaacgcgat 9060 
agtcatgaac gggggtacgg cggccagcag ggcgaagatc gttgtccggc ccagccgcca 9120 
cttcagccgg atcgcgacca ggacggtcag ggacacgtag acgatgaagg cggcgccgtg 9180 
gagggtgccg aagatccgta cgccgagttc ggtggtttcg gggatgtact tgaggtacat 9240 
cccggccagc agacctgccc acgtgcacgc ttcgatgatc gcgatccagg tgaacgcgcg 9300 
cagcagacgg ctggtgccgg tttcggcagc ccgtgcggcc ggcgcgtcgg cggaaggcgg 9360 
ttcggaggtg ggcgtggagg gtgtctgcgg ggtgcctggg cgtgcgggcc acagggcgcg 9420 
gttgccgagc aggcggacga tggcggggac caggagcggg cggatgagga aggtgtccag 9480 
caggatgccg caggccatgg cgaagccgaa ctggaacagt tcgcggatcg gctgggtcat 9540 
caggacggcg aaggtcgccg cgaggatgag gcccgcggag gagatgacgc cgccggtgcg 9600 
tgtcagtgcg gcggtgatcg ccttcgctgg gggctgggtg cgcagttcct gcttgaaccg 9660 
gctcatgatg aagatgttgt agtcgacgcc gagcgcgacg aggaagacga agatgtacgc 9720 
ggtgacgcgg ttgccgatgc cgtcgtcacc gaggacggtc acggtgaaga aggtggtggc 9780 
gcccagggtg gccaggaacg acaggagcag ggtcgcgacc aggtagagcg gggcaaggag 98 4 0 
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cgagcggagc agcaggacga ggaccacggt gacgatggct aggaccagca gcacgatgag 9900 
ggtcgtgtcg cggtcgaggg cggagcggat gtcggcgttc tgcgcggtct cgccgccgat 996C 
gagcaccgtg gcgtcctgga cgccggcggc ctgggctgcg gattgtgtgg cctgcttgag 10020 
gggaccgatc gcgtcgagtg ccttggagct gtaggggtcg aggtcgagga tgacgtcgta 10080 
gaagacggtc ttgccgtcct tgcccatgcg ggggtctgcg acacggctga cgtgarcggc 10140 
gtcggtgagc gcggtggcga tgtcggcggg tgcggggctg gagcgcaggt tgtcctggga 10200 
atggacgacg acggtactgg gggcgatctc gccgggcccg aattcctccc gaatgaggtg 102 60 
ctgtccgtgc tccgactcgg tggcggcgcg gaagccgctg agggtgttga agctctcctg 10320 
gtagccgagc agtcccgcgc tcagtaccac caggagtgcg atcacggccg aggccacctt 10380 
gacgggggcc cgtgcgacca gggcggcgat gcggtgccag atgcctgcgc cgcgactgcg 104 40 
ttcggcggcc ttgtccacgc ccccgggcca gaagacgctc ctgcccagca ggaggaccag 10500 
ggcggggatg aaggtgaacg ccaccagcgc catgacggcc acgcccagag cgaggtacgg 10560 
tccgaagccg tgaagtgccg gggagacggc cacgagcagg gcaaacatgg cgagcacgat 10620 
ggtcgaggcg ctggcgagga cggactcggc ggtgcggcgc acggcggcct gcatcgcgcg 10680 
ggcgcggtct ggctcgtcga gcagggtctc gcggtagcgg gcggtgatga tcagcgcgta 10740 
gtccgtgccc accccgaaca gcagcacggt catgatcgag gcggtctggg agctgaccgt 10800 
gatgactccg gcgtccgcga gaatcgcgcc gagagtctcc gccacgcgca tagccacgcc 10860 
cacggcaaga agcggcacga gcgccatcag gggcgagcgg tagatcgcca gcaggatgat 10920 
caggacgagc acgacggtgg ccagcagcag gactttgtca ccgccgctga agaccttcac 10980 
ggtgtcggtg gcgatcccgg cggggccggt caccgcgacg tcggcgggcc cggcccggtc 1104 0 
ggacgcgagg gcacgcacct cgtcgaccgc attctggaag gactcgtccg aggggctgcc 11100 
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ctccatgggc acgatgacca gctgagcacc gcggtcctgc gagaccaact cggccgcagc 11160 
gtcgggagcg gtcaccgtgg agaccacgct cacgacatgg tcgggrcggc tggttccgga 11220 
aagggccgag gtgatggcgg cgaccgattg cgtggcgctc ttcgcggcgt cggtgccctt 11280 
gccgcggacc acgatgatcg ccggcgtcgc gtcctggccc ggaagctggg cgcggacgag 11340 
atcacgggcc ttcatggagt ccgaggcggc gggcggcagg ttggcggagg cgttgtcctc 114 00 
gacggattcc agggccgggg cgaccccggc gaggaggccc gcgatcagga cccagaaggc 11460 
caccaccacg gcggcgcgct tcttcgatcc caggagacat cgcagcagag cgggggagtt 11520 
catcggttgc atcgggcagc cttcggcagg aagtacggac agaacttagc gacagggtgt 11580 
ctctaagttg cgtcaagcta acacgccccc tcggcctctc gggcgtgggg gtaggttggc 11640 
gggagacggc acagcgtccg aggtgaagcg gagaaaatgc ccaagattga agccggcagc 11700 
gtccgggagc accgggcgca gcggctcgcg cagctgattg acgcggccga ggagctcctg 117 60 
gaagagggcg gtgccgaagc cctcacagcc ggagcggttg ccgcgcgagc cgggatcgcc 11820 
cgcaacagca tctaccgcta cttcaactcc atcgacgacc tgctcgaact cgtcgtcacc 11880 
cgcgaattcc ccgcctggat cgacgcagtg gagcaggcca tcgcggccga gaccacaccc 11940 
gccgcccagg ctgccgccta cgtcagggcc aacctcgaac aggcagctcg cggcacccac 12000 
ggctggcggg ccgcgctcac gcgcgactcg ctctccccgt cggcgcggga gcgggtgagg 12060 
aatctgcaca tctcgctaca cgaggcgctc gcccgggtcg tgcgcgaact ggggcagcca 12120 
cagcccgagc tgaccgtggc ggtggtccaa gcagtcgtcg atgcgtgcat ccgcagaatc 12180 
gaccaaggcg acgatctgac aaccgtgtcc gacttcgcgg ccggagcgac gcgtcgactg 12240 
ctcgcggatg acgacttgcc acatcacccg tgacgcaccc cgtccaggcg gctcgcaggc 12300 
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ccgtcgacag cgaagccccg gcagaacgag ccggatcttg agccgcaccg gagcgtgacg 12360 
cagaccgctg gtggctcatg cctcgtctca tccgatcttg ccaccgggcg gccgaccggt 12420 
cagtgcccga cgcccatcga ttacgacgtc cacgacccga accagcgcgt tcagtgcgtt 12480 
gacgttcgtg gtgcgctcat tggtcacccg gcctctgggg gtcaccagcg cttttagggc 12540 
acgagactcg acggtggcgc gtgataccag gcaggcatca tgaccttatg gcgatgacac 12600 
tccggcttcc cgacgacctg gacacgaagc ttacggagcg ggctcgtggg gagggttgca 12 660 
gcaagcagga acttgccatc ggggccattc gtgatgcccg ggaccgggcc gagcrgaagg 12720 
tcgatgacgt tctggccggt ctgatggaca gcgatgcgga gattctggac tacctgaagt 12780 
gagcggcgtg cgctacctcc agatcgacga gatcctggcc atcgtgcgca cggtcaacgg 12840 
tgccgagcac agcgtgcgtg acatgggcct ccttgtgtcg gcgatcgaac ggccccggac 12900 
gaacgtcttc ggagccgagc tgtatcccac cctgcacgag aagccgcggc actactgcac 12 960 
tccgtcgccc gcaatcacgc gctgatcgac ggcaacaagc gcaccgcctg gttcgccatg 13020 
cgcgtcttcc tgcggttcaa cggcgccagc gccagtaccg tcccgcccca cgggcgccgg 13080 
cccgacggac ccgaggcccg tcacgcgctg ctcaccagca gccctctcct cagcagcgca 13140 
ctgggaccgg cgctgctgat cgccctgtcc gccctggggg ttctcgccct ggacacggcg 13200 
ttgtgggtct cggtggtcag tgaggtggcg gcgccggccc ggtggggctt cgtgggcggg 132 60 
ctgcgtgtcg gcgccgggcg tctgggagcc ctgatcgccg gcgtactcaa cgccgtgatc 13320 
ggtcttggcg tggtcgctgt caaactcatc gccgggcact gagagggcct gtggtggtgt 13380 
tcgcggagcg catacggtgg cagaccggtc ggaatcctcg gcgccgcggc cggagcggtc 134 40 
ccggcacccc ggcgaacagc cgcacgtccc cgtccggtcg ggtcaggtcc gagccgtcag 13500 
atccaggtca gtcgccacag gcgcagaagc ccggtgccgt ccaccgcgta ctggccgccg 135 60 
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cccacgtcct ccccggacac cacgaagtcc ttggcctgcc acagcgggac gacgggcacg 13620 
tcgcgggcga cgatccgctg aagggcttcg aggtcggctt cggcgtcgct ccggtcggcg 13680 
aagcgctgac tgctcgtgat cagccggtcg gcggccttgc tgccgtaccc cgtcgccatg 137 4 0 
gtgccgtccg tgccgacgag aggaccgccg aaggtgtcgg gatcggggta gtcggcgacc 13800 
cagccgacgg cgtaggcgtc gagctctccc tcggcccagc tcttctggaa ttcgtcccat 138 60 
tcatatcctc tgagggtcac cttgaacagc ccgtcggcct ctagttgctt tttcacctcc 13920 
gcagcctcct cgtgggctga tccgcgtccc gccgcgtaac cgtaggtgaa agacagcggg 13980 
atttcctcac cggcctcgac gaggaggcgg cgtgcctttt cggcgtcctt gtgagggtag 14040 
ttgtcgaaga aggaggtggt gtggcccgtg atgctcgtcg ggatgaggga gtagagcggg 14100 
tccacggttc cgtcgtagac gtcgtaggaa atccggtccc tgtctatcag ccaggccgcc 14160 
gcctgccgtg cgcgtctgtc gtgaaacggc ttgccgcggc ggttgttgag gtacaggttt 14220 
cgagtctccg cgctctgcgc ctccgtcacg cgaagccccg gatcgctcgg gttcagatcg 14280 
gcgagcattt cggggggaag ctgtctgagg gcgacatcga tgcggtggga tatccaggcc 14340 
cgggcgagtg agtcgggggt gtcgtagaag tggagttcga tcggccggcc ggtgttctcg 14400 
gcggcgccct tgtaccgagg gttgggcgag agggagatct tctcgccctt cgcgtaggag 14460 
acgacgccgt acggtccggt cccgtcgatc cggccgtccg agcgcaggga gtccgccggg 14520 
tacgtggtcg agtcgacgat cgagcccgcc ccggtcgtca gcttgaaggg gaacgtggcg 14580 
tccggtgcgg tcagtcggaa agtgacggtc cggtcgcggg cgtccatcga ctcgatggtg 14 640 
tccaggaggg acgacggccc cacgtcggaa tctatcttct tgacccgttc gaacgagaac 14700 
cggacgtcct tggctgtcat tctgcgtccg ctggagaagg tgatgtcatc ccgcagccgg 147 60 



catcgatagg 
ggctccgcca 
gtcgagccgg 
accttgatgg 
gccgtggcga 
atgaggctcg 
tgcgtcggcc 
gaagtcttat 
ggtcggcacc 
tgcgcgggtc 
gccatgttcc 
^ggtggcagt 
cacggccgga 
ggcggcgctg 
cggcgcagcc 
gggacccccg 
cacgagggcg 
cggcaagacc 
caagaccggc 
gagcggcggc 
gttcaacctg 



tgcgtaggcc 
ctccgggctc 
agtcgtagcc 
tggtgccctc 
tcaccgtggg 
tcctcgcaag 
aagtatgcgc 
gactgacacg 
ccctctcagc 
ccgcccacgg 
gtgttgcagg 
gcggtactcc 
tccctcatgg 
ccgctggccc 
gggaacaccg 
cccgtcgcgc 
tccgagcttc 
gcgaaggcgc 
aactcccgtg 
gcgctggtcc 
accgggaacg 



ggaatcggtg 
cagggtcagc 
gccggccggg 
ctggtcgtcc 
tgtgatgacg 
atcgagacga 
aggcgtcgct 
gtggaactgc 
agccgttccg 
tctcactcct 
gcgtggccat 
gtttcccacg 
gaggcgctgt 
tgaccgtcgc 
ggtccgtcgt 
agtcccccac 
cgcccgtggc 
cggtccgtcc 
cgcgtgccgc 
agcagatcaa 
acgcctacta 
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aaggagcagc 

agtgtctgga 
tcgagtgacg 
gtcgggtaaa 
gacgcacgga 
acaggaattt 
tccttcggag 
tatgccccgt 
cctcgtctcc 
cgatgccatt 
gccaagtcgg 
tcctctcccc 
gagaaagtca 
catgagcgtg 
gcacgtcgcg 
cgccgagacg 
cgcgagtaag 
ctcgaagtcg 
cgcgtgcaac 
ggcgtccacg 
cgccttccgt 



tttcggctgc 
agacattgct 
tcggcggttc 
gcagcagacc 
tgtgcgcacg 
tcgtacccct 
cccgacggca 
tcggcgagag 
ggtggtcctg 
ccctgtgcaa 
gaggtcgttc 
cttcagtcgg 
ctggtacggc 
ggcctgctgt 
gcggacgacc 
gagcacgtcg 
gacgcgctca 
acggacaagg 
gtctccgact 
accgactgcg 
gagtcgcaga 



gtcgggaacg 
gtacagagtg 
cgtcgtcccg 
ggctgccagt 
aataggtctc 
gggtggagag 
cttccggaac 
ggccgccagg 
cggacccgct 
tgtcacctgt 
gtcttccgtc 
ccgtgctccg 
gaggtctggg 
cgcagccggc 
cggagcacgc 
cgcagggacg 
aggaggtgta 
cggtcgccgg 
tcaccagccg 
tcaacaccct 
tgaccrcggt 



14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
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cgcctacgcc 
gcagctcgtg 
gggcacctac 
cagcccgcac 
gctcatcgac 
ggactacgac 
cacggtgacc 
cggcctgatc 
ccagtcctac 
actgcgctcc 
ccggacggcc 
ctgctcctac 
gacctaccca 
gtcctccagc 
caacggcccc 
caccgactac 
gtacctggag 
cgagtggctg 
cgacggccgc 
gtgggtcgaa 



ctgcgcgacg 
ctctacctgc 
ggcagcagcc 
tcccgcgacg 
agcgccgagg 
tccacctgga 
ttccgcggtc 
gacgcgctct 
ctcacgtcga 
aaggtcagcc 
ccgctgtggg 
tacggcacct 
tgcagctcca 
tgcagcagcc 
gtcgcgaacg 
cagacctacg 
gggaatccgt 
cgtccggact 
ttcgacatgt 
ggcttcgccg 



gctcgacgtc 
gcgccggcta 
tgcagaccgc 
tcaacgacgc 
agaacgcccg 
actcgtcgtg 
accaggtgcc 
acaacttcgc 
acgcgggacg 
ctctggccgg 
tcggtgtcgc 
gcgacctcca 
gcatcaccat 
tgcgcaacca 
acaacaacag 
ccggcgcgat 
cggcggccgg 
tccagatctg 
acggcgactt 
agtacgtctc 



ctacccgggc 
ctacgtgcac 
gatacgcgcc 
caacggcgag 
ctacatccac 
gtggatgctc 
cgcgttcgtg 
gagcggccac 
tgaactcggc 
cggcctgctc 
cgagatgacc 
ggcacaactg 
caaggcgcag 
ggacgcctac 
caccatcgag 
gtacgggatc 
caaccagccg 
gaacctcaac 
caacgccaac 
ctactcctac 



aacgcctcca 
tactacaacg 
gggatcgacg 
acgctcgccg 
gtcgtcaagc 
aacgcggtca 
agtgccgtgc 
ctcgcgctgc 
cggttcctgc 
aactccagct 
gactactacg 
gcccgctccg 
cagatgacct 
ttccacaacg 
gtcgtggtct 
gacaccaaca 
cgcttcatcg 
cacgagtaca 
atcaccaccc 
cgcggcgtcc 



ccggtatgcc 
ccggcacggt 
ccttcttcgc 
aggccgtcac 
gactgctggc 
acaacgtgta 
agtctgaccc 
tgggaacgga 
agcattccgc 
ccatcaaggg 
acaaggccaa 
tcctgacggt 
cgggcgagct 
tggtccgtga 
tcgactccag 
acggcggcat 
cctacgaggc 
cccactacct 
cgaccatctg 
cctacaccga 



16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 



ggccacgacc 

cagccacgac 

aaaccaccgc 

cgcccgcagc 

gctggcggcc 

ggcgcccacc 

gtccaccgac 

ctccacggcc 

gaccgtcacc 

cggcggaggc 

ccaacgcggc 

ggccggcacc 

ctacagcacc 

caacaaccac 

cgccgtcagc 

ccaaggcacg 

gcggcggcag 

cgggcatccc 

cgcgccatgt 

gtcggcagcg 

caactcgcag 



gaggcggggc 
accacgcgca 
gccgacatgg 
tacctgaccg 
tgcgcggccg 
gccgcgttca 
gccgacggca 
accaaccccg 
gacgacaagg 
ggcggcaccg 
aaccagtccg 
acccagctga 
agcggctggc 
accctgacca 
agcttcagcg 
accctcacga 

gggggagacc 

tacctcccag 
ggtccctcat 
gtgaccaggg 
cggaccagca 



gtcgcacgta 

tctaccgctg 

acaccgtcct 

gcaccatcgg 

gcaactgcgg 

ccaccgccgt 

Gcatcgcctc 

tcaagacgta 

gagccaccgc 

aatgcaacgg 

ccaccaccgg 

agatcaccac 

ccggcaccac 

tcaccaaccc 

gcgtcaccgt 

cggcccgggg 

tccgtcgccc 

gaggcagagc 

catgtccgtc 

cacgtcacgg 

catctccacc 
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cgcgctgagc 

gggctacctc 

cagccactac 

cacccgctac 

tggcgggggc 

ccagggcctg 

ccgctcctgg 

cgggtcggcc 

caccgccacg 

gaccgacacc 

caactacgcc 

ctccggcggg 

gagctacacg 

gccggccggc 

gagttccgcc 

cggctctccc 

cggaccgaga 

gtgaagtcat 

ggtctcacct 

ctcagcgcct 

caggaggcac 



accctgttcg 

gccgtgcggt 

cgcgcgggaa 

gacaacgact 

accaacccgc 

aacgtcacct 

agcttcggcg 

gggtcctaca 

aggacggtca 

cgggaactgg 

tacctgtacc 

acgggcgacg 

cagcgggcca 

gccaactaca 

tactgaccca 

cgccccgggc 

acacatcgcc 

tacccgcacg 

gcgcactcgc 

cgcaacaggc 

agcggcgcgt 



acaccacgta 
acatgctcga 
accggaacgc 
ggtacacctg 
ccgggaacca 
tcaccgacca 
acggcaccac 
cggtcaagct 
ccgtcggcag 
gccagaactg 
tctacgtccc 
cggacctgta 
cgggagccgg 
tcagcctgca 
cggctccgca 
ggcgtccggg 
cgcccgcaca 
caggcgacgc 
cacacccgcc 
cgcggccggc 
actgcggcag 



17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 
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gagcggctca ccggcgtcgc aacagcgctg cgtgagcgcc tgggttcccg cttcgcagga 1854 0 
gcctggatcg accagaagca cggcggcagg ctgaccgtcg ccgtcacccg gtcgacggcc 18600 
acggccctcg tcgaggcccg gtccgctcag gctcaggcac ccgacacgac caccgtcgtc 18 660 
gtcgaccgca gcctgcggca actcgaccgc atgtccgcag gactggccca ccgtatcgcc 18720 
gcagcgaaca agggcgccgc ccacggcctg cagtccgcgg tggtggtgca ggacaacaag 18780 
gttcgtctgg acctgccacg gggcaagacc ctcacccccg cccagcacgc agtcgtggag 18840 
tgggcgaagc ggaccctcgg cgatggcctc gaggtcagca cctacgcgca tgcctccgaa 18900 
cccttctact gcggcggcca gtactcgtgc gaccccccgc tgcgctcggg cctggccatc 18 960 
tacggcacga acgtccgctg ctccagcgcc ttcatggcgt acagcggcag cagctactac 19020 
atgatgaccg ccggccactg tgcggaggac agctcgtact gggaggtccc cacctacagc 19080 
tacggctacc agggggtcgg tcacgtcgcc gactacacct tcggctacta cggcgactcc 19140 
gcgatcgtca gggtcgacga ccccggcttc tggcagccgc gcggctgggt ctacccctcg 19200 
acccgcatca ccaactggga ctacgactac gtcggccagt acgtgtgcaa gcagggctcc 19260 
acgaccggct acacctgcgg gcagatcacc gagaccaacg caacggtgtc ctacccaggc 19320 
cgcaccctga ccggcatgac ctggtccacc gcatgcgacg ctcccggtga cagcggcagc 19380 
ggcgtctacg acggctcaac ggcccacggc atcctcagcg gggggccgaa cagcggatgc 19440 
ggcatgatcc acgaaccgat cagccgagca ctggcggacc gcggggtcac gctgctggcc 19500 
ggctaagcag cccgggcgga ccgtgagtac gccgccccgg tcacatcacg aggacgtcga 19560 
ccgccgcacg cgcggtcggc gtctttcccc gtgctccgct ccgtccgcca cccagcggac 19620 
tgggggcggg ggcgtggcac gtcgtgcacg ccgcagcgcg gtggaacccg tcggccgatt 19680 



agaccgtacc 
gcgtcaccca 
catgatggtg 
gatgccgctg 
gcatggtggg 
cgagcatgtg 
agaaggcgcc 
ggtcggccaa 
acaccgggac 
ctgggtagtc 
ctccggtcgc 
cgtcgagcaa 
ccccggacga 
gggcccggcc 
ggaagacccc 
gggtgtatcc 
gcacgcgcaa 
tccattatgc 
accagtagcc 
ccgggtgacc 
gcctgaagcg 



ggggagcgcc 
tttctggaag 
caccacctcg 
ccacagcgga 
ctggacggcg 
gaactcgtgg 
gggccagccc 
ggacacctcc 
ggttctgata 
gcaacccagg 
gtacacactg 
ggtgggtacc 
gcggggaggc 
gacgacctgg 
ctcgccgagc 
gagcgccgca 
gtggcgcagc 
gccaggagga 
cgagggttat 
gccgtggaaa 
caccgccatc 



tttccggctc 
tgcggagcct 
acggccgacg 
tcccccgcga 
gccaccGCca 
gtgacggcgg 
accccgtcgt 
agccggtgcg 
gctcggtggt 
gcgacgtcga 
ctgaccgaga 
accggtgtgt 
cgcagccgtg 
gcgccgtagg 
agtccctcga 
gcagcccggc 
tcaagttcca 
aggcggtggg 
ggatgagccg 
tctcggacct 
gaggagcgac 
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cgttcgtggg 

gcgacagcag 
cctcgaccga 
ggggtttaac 
gacccttggc 
gcctgaatcc 
ccgcggagac 
ccagtgggtg 
ccagcttcgg 
gctcgccggc 
cggtcagatc 
tgatggcccc 
cgagartgtc 
cggtgagctc 
tgcggcgcag 
cgacgccccc 
cgggggcacc 
gaatccggga 
gagcctctgg 
gcgtgttggt 
atcatgcctc 



acgggcgggt 
ttgccagtgg 
atcccgccgc 
cagtactccg 
gatcatcgac 
cgcggcccca 
gaaccacgcg 
atcggcaggg 
agagtgtcga 
ctctaggaga 
ggggcaggct 
gaggcgaagc 
ggagagcgcc 
cacgcccgcg 
ttgggtactc 
cgcgtcggct 
tcgctccggg 
cggcctgacg 
tatggcctgg 
ccgcagaggc 
acacctgcat 



gcgtatgcgc 
gcgcgtacgg 
cagacgagca 
cccaccgggc 
tgcagttggt 
caagcgtcgt 
tcctccgaca 
gtggccacga 
agaggcagcc 
tcgatgagtt 
tcacqgagga 
cgacgtgtcg 
aggatctccc 
ctgctgcgca 
atcgccggct 
a tcgcacaca 
cgaacagagt 
ccttcggtcg 
ccggttgttc 
gactgcggaa 
cagcttcacc 



19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
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gtcgaagcga ccggggccgc ggttcaccgc gcccgccacc gcgtctccac cgcgctgagc 21000 
tggtggggag ggccggtcga ggaagagctc cgcttcagcg cggaactcgt gacctccgag 21060 
ctcctcacca acgggctgcg gcacgcgggc gggcccatga ccgtcgagtt gacgctggtg 21120 
cacgacatgg tcgtcgtcgc ggtcctcgat gacagccggg agctgccgcg gcctcggcag 21180 
acggaggcgg acgacgagtg cgggcgggga ctcgccctga tcgaggacct cagtctgata 21240 
cggggagtcg agaccacttc ccgcgggaag cgctgctggg cggttctgcc gctgcggacg 21300 
ccacaggagc gggctatcga gtcggctccg gctgaggagg cggaccacgg cttcgaggca 21360 
gaccgggaac gctggtcact ggctccccaa ggaagcggac tactggcgag tctgtttccg 21420 
gcgatgtgag ttcgtcctcc tcgggcggcc cagtagccga cccagggcag gcgggcgtgc 21480 
ctgagggcgt gatgacgctc gtctgacgct ctggccgctt tcaagctgca cagcgagccg 21540 
agaaacagcc tttgacctgg ccttttctgc ggctgcctca ggccgacatc tttccgatga 21600 
cgcaccacgt ggagtacgtg gcgattctgg agcctgctgg caaggggttc tgacctgcgc 21660 
ttttgttctc ctgcggcggg cgcggcaagc tcgtgcgggg cagttgggtt tcccgaaggc 21720 
cggtgctcgt gtgtccggcc ggcgggtggg ctgccttcgt ttcagtgggt gcgagagggc 21780 
actcggacgc ctgagccgag atgcggttcg ttcggcacca tggggtccgc aggatgaccc 21840 
ggtcagcgac cgctggcacc tgtggaagaa cctttgcgac aaggccctgg ccgaggttcg 21900 
ctcccacagc gcctgctgga ccacagcgaa cacaccccgc ccggtcggcg tccatgagca 21960 
gaccacccgc gaacgttggc atcagctcca cgacctcctc ggcaagggtg tcggcttgct 22020 
cgaatgcgcc cgccgcctga acctgtccct caacaccgtc aagcgctacc cgcgcacccg 22080 
cgatcctgaa gccctgcgcc ccgtgaagca gctgtttcgc gaggtccagg agcagggctg 22140 
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caccggcagc ttcaccctgc tctaccgcag cacccagggc cgggcagaag gcgaccggcc 22200 
cgtcggaggg tcgcggcttg acgctcaccg tatccatcac tggaacggcg acgtctgatc 22260 
ccgtctgccc ggggcttggg tcccggctgc ggcccgtagg cccggctcac cccagcaccc 22320 
atcactgttc gagagtgatt acctctccgc cggacacatg gaaatctgca tcggctggag 22380 
tagacattgg gcagcagtgt ggttatgttt ctcctgtaac ccagaaggac cgcagggccc 224 4 0 
ggcagagacg aactgccggg cagcagtacc cgcagttgca ggacggtgcg gtggtggagt 22500 
gtcgaagcca ggatggtgca ggacggcgac gggactgacg accggaccgg gcggcccgca 22560 
gtggtcaggg gccgccaccg cagtgcagta cccagcagcg aagtcagtga gcggtacctc 22 620 
ggtgaaggcg tcggctgcgg acgcgcgcgc cgggaggttc ggcagtggtg gttccaagcc 22680 
agagcagacg caggacgggc aacggggccg actgtcggac agtggcgctg tcacaggtca 22740 
ctgagaggtt cgtgtcacca gcagtagagc agtaccagag gaaagaacgg aggaaccaag 22800 
cgccatcagg atcgcccggg cgcagttttg ggcccgggta ccgcaggaca tcgatagtga 228 60 
ggtggtctcc ggtcaagaaa ccgcgatccc cgcgcccccg gcagcaggca ggtcgggtcc 22920 
gcggacacag aaggccggtg cagtatcagg gccggcagat ggtgtaggag ttccttcggg 22980 
gccctggtgc cgcatggcac cagggcccct ccatgcgttc cgcagagagg tgcaqatgac 23040 
agcagacgat tcgtacggcc gtctcgacga cgacgattac cccgcctaca ccatggggcg 23100 
ggcggccgag atgctcggta cgacccccgc tttcctgcgg gccgtcggag aagcccggct 23160 
gatcacgccg ctccgctcgg agggcggcca ccgccgctac tcccgctacc agttgcgcat 23220 
cgcggcccgc gcccgcgaac tcgtcgacca gggcactccc gtcgaggcgg cctgccgcat 23280 
cgtcatcctg gaagaccagc tccaagaagc gcggcgtatc aacgaggaac tgcagaggcg 23340 
cccggccggc ctggtggaca aggccgaggg ctgaggccgc atctgccggc cggtcctgtg 23400 
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agggctcgcc tgccaagacg ggaagccctt gccgcaacga gaagaggcaa ctgtccgcac 234 60 
cgatgtgctg ggcccggtcc tggctaggac tcccgtcttc ttgccggagc gatgcggctg 23520 
tggacgcgga accggacggc agtgtcgtcg ggcgcggagc gcggggcgca cgtcgatggc 23580 
gacaggaccg gcgaaggtgt attcgtgttc ggcggtgtga cggcgcacct ggccggcgag 2364 0 
ggcggcggcg caggtgtcac agggacatcg gttccgactt ccaccacccg tccgggttcc 23700 
accagcgtgt catccacctg atccaggctg ccgcggtagg tgctcgacgt cgggggtgta 23760 
cgggggcagt tgtaccgttc cgccgcgagg agtgacccga ttgaccaccg gcctgtggcg 23820 
ctcaggaacg ggctggactg tcgcagtccg ggccaactca agcccgacca tgaggccgac 23880 
cacggcgccg cgcgaccccg accacagcta cacgcgtggc atgaccaagg cggcacatgc 23940 
ttcgaacgag ccatctcatg tgtgccggta tgaacgtgat cgacgtcccc ggcactctgg 24000 
tgcggacgca agccgtctgg ggcgccaccc acgactggct cgccgccccg cccccgcggc 24060 
gccaccgtcc gtcccgccct gcgtcgtcgc ccgtgtggcg tcatgacggc gacagacttc 24120 
ctcgcgtatg ggccgaccat acggccaacg ccagaggtaa agcgctgtcc atggtgagtt 24180 
ccctgaacag aagggctggc gggacctcct ttccaagacc gtgctgcagg agtccgtcag 24240 
agcgcaggta atcccgtgct gtccgcgacc cagggctgtc ctccgtctgg ccgagggtcc 24300 
tcgtcttctg ggcgacatcc ctttagcgtg ggcggtagcc gccgaaggga ggcgccatgt 24360 
cggacgaatt gacgggcccg ttgggaacgg caatgcggga ggtcacgttt ccggaccggt 24420 
ctcgcgggat catcttggtg cgggctggaa caccgcaggc cgaggccgag gcaatggccg 24480 
cccgtatgtg ggccgagatg ccggaaggct gacgtgcccg aacgcagaca acccgtaccg 24540 
tcctcacacg cattcccctg agccgtcggc catggaacgg aaccagccgt acgaaccccg 24 600 
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gaggcgccgt tgcggtctct gcggcgaggc cggggccacg cagggcgaag aggccgcgcc 24 660 
gcgttctgcc gcctggcgcg gctgccggct gttcacgaga acaccgaggg aggagtcgcc 2472C 
cgcctcttgc ccggcgcgtt gccgggtgga gagcaggtgg tgaaggactg gctcgctgaa 24 780 
ggcggccgag gcgacctcgt cggccggcct gaacggcttt cactgtccca gcggcggcag 24840 
gccgccgaca caggcatgct ttgccatctc cctcgctgtc tactgacccc agcagcagga 24 900 
tccagtacgg cgtcgcggcg ctgccgcctc actcgcgcat cgatcgggga atgcggcatg 24 960 
tggtgagggc ccggccggcg tgccggtcgg gccctcacac tgttttggtg tcggcgcgtt 25020 
tgtcgtgtcg gtcagacgga caggtggggg gcgccgagca tggcggaagc ccgctgcaac 25080 
ggactgtcgc tgcgcgcggg ggcggcctcg ggaagggtgc ggcaggtgaa gcccagctgg 2514 0 
gccatggccc ttaggatttc gccggtgctg aagtcgcggc ggtcctggcg ggtgatgacc 25200 
tggccgacct gcttggcggg gtagtggcgt cgtccgatga tcacggactc gccggtgacc 25260 
ggttcgggtt tgacgccctt catcgattcc agcacgccgc tcttggtcag gtcgaacggg 25320 
aagcgggcaa tgacacagcg catgatgcct cacaggcagg agagttacgg ggccggccgc 25380 
cgtctggcgg ttcagcggga gagagcgagg acgcccaggg cgctgccgtg ttcgtcgacc 25440 
acgggcacca gcccgagccg tccgaagggc accgcgtcct cggcttcctc cctcgtggcc 25500 
gacggtgaga cgaagggctc gctgtcgtcg gtgatgtcac cgaggcggag ccggtcggtg 25560 
tatcgggagc tgtcccggac ggcggtgagc cgggcctggg tgaccaggcc gacgcaccgg 25620 
gcatcctcgt cgcagacgac cagatgctcg gcacgggcgg cggccatcac ggacagcgcc 25680 
acctcgacgg tcatgtcgta ccagacctgt ggcccggcgg cgtccatgac gtcggccacc 25740 
gtgccgcgca atgggagagc gcctacggag cgatcctgca actgtcctgg cgtcaagggg 25800 
tgcctcctgc gcagacgggc ggggttcctg atcaggacgg tcctaggcgg ccgcgccagc 25860 
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cgtggacttg agtgcggggg tacgccgcgt cgccgaggcg gggcggcgtc ggccgcgtga 25920 
ggtggcgccg cgcttcttgg ggcgttcagt cgccggggcg gtgatgacga ccgggatgcc 25980 
ggtcggggcc tgggctccgg tgatccggct gagggcctcg tcgcccgggc tgacctgggt 26040 
ggtctgcggc cggatcccgg cttccgacat gagacggacc atgccgcggc gctggttcgg 26100 
ggtgacgagc gtgacgacgc tgccggactc gccggcgcgg gccgtgcggc cgccccggtg 26160 
gaggtagtcc ttgtggtcgg tcggcgggtc gacgttgacg acgaggtcga ggttgtcgac 26220 
gtggattccg cgtgccgcga cgttggtcgc caccagcacg gtgacgtgcc cggtcttgaa 26280 
ctgcgccaga gtgcgggtgc gctgcggctg ggacttgccg ccgtgcaggg cggcggcccg 2 6340 
taccccgctg ttgagcaggt cccgggtcag tctgtcgacg gcgtgcttgg tgtcgaggaa 26400 
catgatcacg cggccgtcgc gtgcggcgat ctcggtggtg gccgcgtgct tgtcggcgcc 2 64 60 
gtggacatgg agtacgtggt gctccatcgt ggtgacggcg ccggccgagg ggtcgacgga 26520 
gtgcacgacg gggtcgctga ggtagcggcg tacgagcagg tcgacgttgc ggtcgagggt 26580 
ggcggagaac agcatgcgct ggccttcggg acgcacctgg tcgagcagtg cggtgacctg 26640 
cggcatgaag cccatatcgg ccatctggtc ggcctcgtcg aggacggtga cggagacctg 26700 
gttcaaccgg cagtcgccgc ggtcgatgag gtccttgaga cgtcccggag tggcgacgac 26760 
gacctcggcg ccaccacgca gcgccgacgc ctgcctgccg atcgacatcc cgcccaccac 26820 
cgtggccagc cgcagcttca cagagcgggc gtacggggtg agcgcgtcgg tgacctgctg 26880 
cgccagctca cgtgtcggta cgaggaccag ccccagcggc tgccgaggct cggcccgccg 26940 
gccggccgta cgggccagca gagccaggcc gaaggcgagg gtctttccgg aaccggtgcg 27000 
cccgcggccc atgatgtcgc ggccggcgag ggagttcggc agggtcgcgg cctggatcgg 27060 



gaacggcacg gtcacccctt gttggccgag 
atcggcgaag ccctccgcag cgggaagcgc 
cccctgaacg gcgccgggcc ggcggccgta 
gcgcggcgcc ggcgaaccga agcggctgcc 
ggtgcgagcg aagcggtcgt tcgtgcgtgt 
cacatatcaa ggaatttccg aagcaatgag 
gggccttgcc agcggatctg gccgacagaa 
gggggtgctg tgggctcgat attcgaagcg 
ctgcaccttc gaaggacgat ccgtgtgcgg 
aggtgaaatc actgcgggaa acgcatgtag 
ccagctacaa gtacgtgaca gtcggcgtca 
cttctggccc tgcgcgatgt cgaagttcac 
ctgggcggcg atgttcgagt agtgggcgaa 
gaagccgaag cccttttccg cgttgaacca 
ccttcggggc agtcgtacgg gatccgcacc 
cccgcccgga aaaagaccgg agatgtaaaa 
gcacttgaaa tttcgggaac cacaactgca 
ccactgtgcg gtgaagaacg ccaccttgct 
cgatgaaaac tcaaaccgcg cgcacagata 
gccgcgctcg cgtgatccgg tcccccacca 
cgcgggggca gacaggtccg gccatgacgc 
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cgcggccagc agttccccgg gcatgtcgag 27120 
gggggtgatc gtccggggga gggcgaactc 2 718 0 
accgccggag cggctgggtc cggccggccg 27240 
gccctttccg gagtcggcac cgccatgacg 27300 
gcggttcata cggaaccttc ctcgatgcgg 27360 
cagcacggag aatcgcaaga atggaccggt 27420 
aatctgtgcg gcacgtgcgc tggaatgatt 27480 
tccactgcac tgtagctatg aaggatgcgg 27540 
taaacacacg ctgtccggag cgtcgtccgc 27 600 
ctggggcccg caccccgaag gatgcgggcc 27 660 
ggcgggaacg atgttctcgg ccgtcgggcc 27720 
cttctggcct tcgagcagct cgcggaagcc 27780 
cacatcagcg ccgccaccgt cctgctcgat 27840 
cttcacggta ccagcagcca tgtcatttct 27900 
gcgcggacct cgtgtcgccg caatgatcac 27960 
gtgcttccag gggtactgag cccgaccgga 28020 
actgacatcg acagtagcac gccacagcag 28080 
tattgcggca gagaatctat ccgcatgctc 28140 
ttgaccttcg cgcgacgcca tatatcgcat 28200 
cgctctccgc tactgcacgg gtcgcaccgc 28260 
cggccatgct cggggcgtag cggacgcctg 28320 



ccggtcgggt gtacgtctcg cgcgcggcga 
gtcttgcctg gcaaccggct gtcggctcgg 
gttctgcgcg cccgcttccg tgaacgcgcc 
cctcaggagg cgtgacccgg ggaagccgag 
ttgccgccgt aggtgacgtc ctcgaagtat 
tccgtcaccc cgtatgccct ggccagttcc 
ttcgcgcggt ctgggtcggc ggccagcgcc 
tccgcgatcg cgaacgtcgg ctcttgggcg 
ccgaagtggg agagcatctg ctccgaacgc 
ccctcgtact ccgccagctc ctgagccagc 
gccaggtcgt agtagatgtt ctcgcggtag 
acttccacat gcagcggcgc gtcggagcgg 
atcacgtgcg agcgcgcgcc cagctccagg 
cagctcttct tcccgaacac cgaggtggcc 
acgagaatgt cgagccgctc gtactcccgg 
gcttcgtcga gatggtcggt gggaactgcg 
tcggcggtct cctcgatggt ctcggtcgtc 
gttcggccgg tcacatacac ggtagcgccg 
cccgccccgc gggtagcgcc cgccacgagg 
ccggcctgct cggtgttctc agtggtctgc 
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gcactgcggg ggaggggccg gttgccagac 28380 
gctggttggt cagccgtggc aggtgatgtg 28440 
gcagccccgg ctgccttcta ccaggccgac 28500 
gatcagcggt agtcgtcagg ggaggcttcc 28560 
gcccaggcat ccggccggct gccgtccacg 28620 
ccgctggagg tggacrtgcc gttccaccgc 28 680 
gcgaccgtac gggccaggta gtgcggggac 2874 0 
atcgcgtcac gccagttctc ctcactcaca 28800 
aggaagcccg gggacaccgc gaccgccgtg 288 60 
ccgaacgcga ggcggatcgg ggcgttcttc 28920 
cggcggttgg agtgcgcggt accgtcggtg 28 980 
atcagcagcg gaagcagcag cgccgccgtg 29040 
atccgcaggc cgtcggcgag cggtgtctcc 29100 
agaaggtgct cgccgcccca caggtcgttg 29160 
tcgatccgct cgacgagggc gcggacctgg 29220 
attccggtgc cgcccgctgc ggtgacgagt 29280 
cggccgacct cgctggcccg ggcccgggtg 29340 
gcccgcccca gttccacagc ctgagctcgt 29400 
gcgatccgtc ctgccagcgg acccttcgga 294 60 
ctggtgatgt cctcgttgct catgtcatcc 29520 



atcgttcacg ctaaaaccga cagaacacgt 
tcccggccat agcgccaact acgtcctcgc 
gggtgacgcc ggtcaggtcg gggtaggggc 
ggccaccgcg cgcatgatct cccagcggga 
gagaccagct cacccgggac acccggtccg 
agacgcgttc agcagcccac atcccaggag 
ggaccatcgg gctcggcacc gggagtgcac 
cacggtaagc ggtgaggtgt cgaggtccag 
cggccgaccg gcatgccgag cgggcagccc 
ggcacgggca tgggcggcac ccccaccccg 
gtgacgggct tcgggtcaga cctgtgcggg 
caggttgatc tcgccgtccg tgtgccggac 
gatgctgagc tcgtactcga atcgcgctcc 
gagcacgttg acggcaccgt cgtgcggggc 
cgagcgcagg cccacggtgg cggacgaccc 
ctgaccggtc tcacccagtt cgacctgtac 
ctccaggagg ttgcagcggc cgatgaagcc 
gatctcggtc ggtgtgccca cctgctggag 
ggacagggac atggcctcga cctggtcgtg 
ccgctggagg tccttgagcc agacgcgggc 
gagcggttcg tccaggagca gcacgccggg 
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caccttttat gtggggggta ccgcgcatca 29580 
actgagcgtt ttcagcgtgg gccaccgatc 29640 
cgcaacgcac aaggctcgcg tgcacgacat 29700 
gcccagccgt ccccggcagc cccagccgct 29760 
acaccgcaca cgatcaagta gtcgacctcc 29820 
ccgtctaccg tcccaggaac ccctgctccg 29880 
agttgatcag taactggcaa cgagctcgtg 29940 
atgggcggcg gcggtggtgc ccccagcggt 30000 
accggtgtgc cgagcggcgg acccggcggc 3Q06Q 
cagcacctga agtcggtcag gaccggccgc 3C120 
gaacagcagg cagtcgtccg ggcggatgat 30180 
ggggctctcg gcatggacgc gcacgtcgcc 30240 
ggtgtacgag cactgctcga tcctcgcccg 30300 
gtcggcgcgg tcggtgagcg tgatgcgttc 30360 
cgcggagcag gcgccggcca ccctcaagcg 30420 
ggctccgccc tcggtggcgc cgacgcgccc 30480 
ggcgacctcg ggagtggcgg gagtctcgta 30540 
gtgtccgtgc atgaacacgg cgatgcggtc 306C0 
ggtgacgtac acggtggtga tgccgacctc 30660 
ctggtcgcgc atcttcgcgt ccaggttgga 30720 
ggagtagacg atgcctcggg cgagggcgac 30780 
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gcgctgctgc tgtccgccgg agagctggtg ggggtagcgg tcgcgcaggt gagccatgtc 30840 
gaccttggtg aggacgtcgt cgatgaggcg ccgttgctcg cccttggtga ccttgcggag 30900 
cttcagcggc agtgcgaggt tgtcggcgac ggtcatgtgt ggccagagcg cgtacgactg 30960 
gaagaccagg ccgagattgc ggccttcggg gggcaccgtg ctgcgccggg tgccgtcgaa 31020 
gaagacctgg tcgccgacac ggatggtgcc cgagtcgggg gtctccagac ccgcgacgca 31080 
cgacaaggtg gtggacttgc cgoagcccga cgggccgagc agagtgaaga actccccgtc 31140 
cgcgacggtg aagttgacgt cctccaggac cgcggtcccg tggaaggact tcttgatgtt 31200 
ctcgacgacc agctcaggca tgcttcttcc ccttcaggag gagaccggcg aggccggcga 31260 
cgacggcggt gacggcgatc tggagggtgg cgagggcggc cacggagccg gtctcaccct 31320 
gggtccacag atcgatggcg gtggtgccga tgacctgtga ctcggctccg gcgaggaaca 31380 
tggcgggggc gtactcgcgg atcatctggg tccagatgag caggaacgag gcgagcatcg 31440 
cgggcacgag gagacggagc atgatccggg acaccgtgcg ccaccagtcg gcgccggcga 31500 
cgcgtgcggc gttgtcgagt tcggctccga gctgcatggt cgccggggag atcgcgccgt 31560 
.cgccgacgg gagtgcccgg atgccgaagg cgatgatcag cgcgaagagc gtgccgcgca 31620 
ccgcgtcgcc gccgggtatc caggtgaagg cccagaacag gccgatgccg acgatcaggc 31680 
ccgggaccgc gtgcggtgac tgcgctgtcg tctccaggag acgggcgaag cggaagtcgg 31740 
agcggcgtgc cacgaggacg accaccgtgc cgaacagggt cacggccacc gcccccacga 31800 
aggccacggt gatgctgttg acgatcgact cggtgtaggg ggcgtagtcg aagatcagac 31860 
ggaagttgtc cagggtgagc aggtcgaacg ggttcaccag cggagtgagc agcgaggtga 31920 
acgcgcgcag gatgagcgcg agcatcggca gcagtgcgcc gaagacgacg tacagaccga 31980 



cgaaggcgaa gcccagccac ttccaggcac 
tgccgcgcac cgacacgaac cgctgggcgt 
gggcgatggt ggtgagcagc atgaaggtgg 
tgatcgagtc gatgccctgc tcgtagagga 
cgcccaggat gagcgggatg gacagggtct 
cgtagagcat cggcgggcgc agcatcggca 
ccgcgccgac gctgcgggcc gcgttctcca 
cgcagaacag gtaggcgatg gggacctggg 
gtgagtacag gttccagggc acccagccga 
cggccgggcc gtagacgacg atccacccga 
gccagcgcag cacctgcccg aacaggcggg 
tcgccatcgg caccgcgatg gcgagcgcga 
tgtcgaggac gatcgaaccg aagcccgccg 
tgaaggcgcc gccggccgcg tacaggggct 
cgacgggggc gaggacgagc acggcggtga 
cacgtccggc gccgaacagg cgccggtact 
tgcgggacgg cgcgggtggc gccgggggtg 
ggggtgggga caggggcgtt gggcgggcgg 
gcgcgacgta cgcctcccgc acgcgctccg 
ggtccgcgcc gagcctgcgc cgcatgtcct 
ccggccggta cggcaccagg ccgccctcgg 
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cgatgtcgag caggtcggag cgggtcgcct 32040 
gccccagcag ccgcgtctgg aacacgacca 32100 
acgccgcgcc cagcaggccg tagtccggat 32160 
agttggagaa gagggtgatg ccggcgggct 32220 
cgatcgccgt gccgaagatc agcagacccg 32280 
ccacgaccga gcgcaggacg cgcagaggcc 32340 
gagaggtgtc ggaggcggcc agcgcgttgg 32400 
cgacggcctc gacgaacgcc ataccgggca 324 60 
agccctcgcg caccgcgccg gtcaggaagc 32520 
aggccaggac gagcggggag atgtagatgg 32580 
cggcggggaa gcgggtgcgc tccagcagaa 32 640 
acacggtcgt caggacggcg aagaggaggg 32 7 00 
acgtgaacag gtgggtgtag ttcgagaggg 327 60 
ggttgcggac cgactggtag aggatcggta 32820 
cgaggaacgt cagccagtgg atggtgacct 32880 
ggggcgtgcc cagctcgccc gcgcgcggga 32940 
tctggatggc catgacgact ccgtacgaac 33000 
gggcggctca gccggccgcc ttctcccagc 33060 
gcacccgcac gggccggtac agarggacgc 33120 
gcagactgtc catggcgtcc tggcgcacgt 33180 
cgaccgccgc ctgcccttcg gcggagagca 33240 



-137- 



ggaagtccag gaagagacgg gccgcgttcg 
cgcgcggcat gacgacggtg ccctccgcgt 
gctcggcggc gggtatcgcg acgccatcac 
ggccagttcc tggcgcgagt ggaggcgcag 
gggccggacg agggacaggg cgacggggtt 
gaaggtgaat tcctcctctc cggtgaaggc 
ctccaggggc agtacaccca tgccgatcag 
ggctatcacG gcccgcactc ccagcagcgc 
gccggcgccg tagttgcggc ccgcgaccac 
cgccgcggct tcgtggacgg gccccatccg 
atggacgatg cggccgggtc ccttcgaggg 
cgccgatcag gacgacgccg tgcaacggac 
acagcgacga ccccggtgac ttccgatgac 
gaccgtcgac cgcgccgcct gctctgggtc 
gccgacgcgc cgccgggcac cgtcgtccac 
cgacctgcGC acctggtgcc acatgacagg 
cggccggtgt acgccccgaa gctcaccgcc 
tggcacccgc tccggcggcg gcaggagcag 
ggccgtttcg agtggaccgc ggacgcggaa 
accggcctgc gtgtcttgaa gccgagccgt 



ggtgcggggc ggtcttcacg acggacagcg 33300 
agtagctcca ccccagcagt cccccgctgt 33360 
gtacggcagg atcccgccgt gccgcaggta 33420 
cctgagacgg accgtggcgc ggggtgcgtc 33480 
cgtgccgacg cacaggtcgg cgaggccgtc 33540 
gtgggccgat gcggtgtcgc cctcctcgaa 33600 
gttgttgcgg tggatgcgct cgaaggactc 33660 
ctgtgccttg gcggcccagt cgcggctgga 33720 
gacgagatcg tggcccgcgg cgcggtaggt 33780 
cagttgcgtg ccccgatggc cgccctgcgc 33840 
cctggccgcc gggtcgttcc tgacccgagg 33900 
gcgaggacag cgtcagcttc gtccgcggca 33960 
gcgcacgccc ccgccgcccg aacccgagct 34020 
acGctcccgc tccgcctgcg agatcagatc 34080 
gtcgtcgcca ccgacccccg cggcaccgct 34140 
tcacacctgt ctcggcacgc ccccggcgaa 34200 
gacgcgcgcg ccacccgccc ggacgcaccc 34260 
ccccggaacc ggtgacgcat ctcgtcggcc 34 320 
cgtcacggcg tccggaaacc ccggaaggtg 34380 
tcgtacaagg cgatcgcgcc ggtgttcgcc 34440 



tcggccacgt gcaggaaggg acgatcaccg 
acgaggcggg cggcataacc ctgcccgcgc 
tcggtccagc ccggaggacg caggcgttcc 
cggacaccca ggtaggtgcc gagttcatgg 
cgcgcggcga gatccagcat ctcaggcacg 
tcggacgcgg agcgagttcg gccggggcgg 
tcaagactga aaaccggctc ccaacccggc 
gcgaaggcgc cgggaccgag taggccggcc 
acggacacgg aggagaaggt cgccacgtcg 
cgggcgagat gagcgtgccg accactgagc 
tcgtcgtcgt tcatcatcgt gccgtttcct 
cgcggtaggc gaaaagtcgg cggcggggcc 
gtgcagtttg ccgggtcgcc tgaggacttc 
tgccggcgga ggcacgcctt ctgacgtgcc 
gagcttgagc gatgcagcgc cgccggaagt 
cctcgttctg ggtgagagca gtcctgcctt 
ccggcgaagg ccaggacctg tcccacggag 
cagcttcagg cccgacgtag ggggaggggc 
gtcggtcaga cctgttcgaa gtggaacgcc 
gcgaagagga tgcactccac gagggactgc 
gtggtcgccc actcctcgga gagcgggtcg 



-138- 

cgcgccgaga tgcgctcggt gagagcgcgg 34500 
gcctcgggag cggcgcagac ggcgctgatc 34560 
ccggccatcg ccaccagggt gccgtcgacc 34 620 
gtacggggcc agaacggccc cggctcggtc 34 680 
ctgtccgcgc ccagctcgac cacgtcggtg 34740 
ccgtcgccgg gccaggtcat ctgacggccc 34800 
ggcggaacgg ccggggagct gaacatgtcg 34 8 60 
aggtcggccc agtcctccgc gtccgggtcg 34 920 
gtgagatagg tggctgctcg accgaaccgt 34980 
gactgaccta ccgggtcgtc gagtgcgggg 35040 
tcctggtgag cgcggtggtc gaagggtggc 35100 
cgtggcccga tagtcgtagc ccttgtcacc 35160 
cggctggagg ccaatgccaa agcgctctcg 35220 
tccaccggca ccactcagtt caggcagatt 35280 
cgagcgcctc aatgcatcga gcggcgactt 35340 
gtcgagtgat gcggcgttcg gaccgtcacc 35400 
tggctcatcc acctccccct cctcggccca 354 60 
gactcggaac ccggcgtccc gctcgcgaag 35520 
ttgatgaagc agtcccgggg ttgggcgacg 35580 
gcggtgaggg cgtcctcggc ttcgcgtgaa 35640 
gcgttgtcga agtcgggcgg gtagagcgag 35700 



-139- 

atcacccgga ctccttgggc gcgcaggcgc ttggagagga tttcggtgaa ccctgcctgg 357 60 
gcgctcttgg ccgcgtagaa ggcgtcgtgt gcgtccgagc ggtggtggcc cggtgttccg 35820 
caggcggaga ccatcgtcac gacgtcgggt gtgtccgagt tgagcaggag ggggaggaaa 35880 
ctcctcgtgg tcaggaccgt gccggtggct ccggaggcga tggtgtccac gacgtcggcg 35940 
tcggttgccg acagcaggtc cggcccggtg aggtagcggg agccgttgtt gacgagtacg 35000 
tcgacgcggt cggtgtgttc cgcgacgccg gaggcgaagt cgcggatcga ggcaggatcc 36060 
gtcaggtcgc aggcgaaggc gtgcacccgc tggtgtccgc ggtcgcggat ctcgtcgcgg 35120 
acccgttggg cggcggcgag ccggcgtgcc gagaggaaga cctccgcgcc gaggtccgcg 36180 
aggcggatgg ccagggttcg tccgaagtcc cggccggcgg ccgtgatgac gacgcggtgg 36240 
ttgtcccatc tcatggtgtc gttccccagt cgccgtttcg tggatcgggt ggtgccgtgc 36300 
accgcgtctc tacgctatcg gtcatggtcg ctcacgaacg gtcgttcacg gtcaatgatg 36360 
atgttgaggt gcccaacccc ggtgcggacg aggtctggac cgtcggcgcg gtcatcctca 36420 
atcgggaagg tcgtgccttt gcccagaagc ggagccggga ccgtcgcctg ttccccgggg 36480 
cctgggacat cgtgggcggt catgtcgagg agggcgagac gcttctggag gccctcgcgc 36540 
gtgaagtcga ggaggagacc ggctggcgcc tgacccgtgt gcggcggttc ctcggcacca 36600 
cgacctggac gggggacgac ggcggcggcc tgcgtcacga ggccgactac ctggtcgagg 36660 
tggacggcga cctggaccac ccgaggctgg aatggtccaa gcactccgcc tacgactggt 36720 
tcggccccgg cgatctcacc cgcctcaagg agaaccgcgg accaggggag tacctgatcc 36780 
acgacctcat agccggtgcc gttgccgact cgcctttcga cttgctccgg gcggacgccc 36840 
tcaccagccc ggaccggctg cgcgagctct acccgcagcc gaacccgaac tcgctgcgca 36900 



aggagaccga ccgcctgacc gaggagaccc 
tcatcggcag cgcggaccgc gagggccggg 
ggttcgtctc ggtgctggac gagcagaccc 
ggctcgacac cctgcacaac gtgctggaga 
ccggccgcGc gaccacgctg cggatcaacg 
tgctcgcccg cctcactccc gtcggaaagc 
agcaggtgta tccgcactgc ccgaagtcac 
agtggatgcc cgccgacgcc cagccgagca 
tgcccggcct gaccctggac cggatcgagg 
acgaatgacg acgagtcgat gagcgccgat 
gtcggcgtcc gcggcgagca gaccggtcgc 
cgcgacggtc agtactgtcc ggtcaggtgc 
aggtgtacag ctggatcagg gtgccgttgg 
ggccggactg gacgccgacg acggacccgt 
cgccccagca gctgtagatc tggaccttgg 
acttgtcgcc gtagaccctg agctcgcccg 
cgctgtggca gtcccacagc tggagctggg 
cgaggcagcg gcccgaaccg acgcccttga 
agtcgccgcc gttgagtgcg tcgaggacgg 
tgttgaacag caacggcgtc tgctccgacc 
cggtgatgcc gaggcagcgc gagacggcca 



-140- 

gggcgctgat cggctgttcg tcactggtgt 36960 
cggacgtgac gccacgtggc ggcccggccg 3'7020 
tggtgatccG cgacgcgacc ggcaacaaac 37080 
ccggacgcct ggggctgctc ttcctcgtcc 37140 
gacgcgcctg tgtttcggcc cgcccggagc 37200 
cgccggtcac cgcgctggtg gtgcaggtcg 37260 
tgatgcgcgc cgacgcctgg cgacccgagc 37320 
gcgccgaggt gacccttgcg cagctgaacc 37380 
atgccgaacg ggagtcgctg cgcctgcggt 37 4 40 
gagccgatga gacccgacgg gatccgacgg 37500 
gaaggtcacc gcccgcacgg cggcgaccct 37560 
gggtccagcg ttggttgctg ccgttggagc 37620 
ccgtgccgtt cccgacggcg tcgaggcaga 37 680 
cggagttgag gcgccacttc tggttgtcgc 37740 
agccgttgcc ggtgcctgcg gcgtccaggc 37800 
cgtcagtggc ggcccactgc tggttggtgc 37 8 60 
tgccgtcgga ggtgctggcg tcgggcacgt 37 920 
tctgtccccc gtccgcgggg ggctccgagg 37980 
cggtgtacgc ggccttcttg ctgccgtcgt 38040 
gccaggagtc gctgtcgcgc acaccccaga 38100 
ggcagtcgtt ggtcacgttg gcgtaggtcg 38160 



-141- 

aggccggggc gccctggatg tccagctcgg tgatggccac gtcgacgccg agggcggcga 38220 
agttctgcag tgtggtgcgg aagttgctgt tgtaggggct gccgctgttg aagtgcgact 38280 
ggaagccgac gcagtcgatc ggcacgccgc gctgcttgaa gtcccgcacc atgttgtaca 38340 
tggcctgggt ctrggcccag gtccagttct cgacgttgta gtcgttgtag cagagcttgg 38400 
cggacgggtc ggcggcgcgc gcggtgcgga aggcgacctc gatccagtcg ttgccgctgc 384 60 
gttgcaggtt ggagtcccgc cgcgctcccg aactgccgtc ggcgaaggcc tcgttcacga 38520 
cgtcccactg gacgatcttg cccttgtagt gggccatcac gccgttgatg tggtcgatca 38580 
tcgcctggcg cagcgcgctg ccgctgaggc tctgcatcca gccgggctgc tgggagtgcc 38 640 
aggccagggt gtggccgcgc acctgcttgc cgttctgcac cgcccagttg tagacgcggt 38700 
cggcggagct gaagttgaac tggccccgct gcgg 38734 

<210> 31 
<211> 3331 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 31 

tcggatctcc ccacacaaca tagatagagg atatccgcct gggttcacaa tgaagttact 60 
ggtggttctc accaccctcg tgggctttag ctcagcacta agtttcggtt gtaattacag 120 
accagtatta ggcttcaatt cacagtatat gctgggagga ctaagacttt tctgtatgcc 180 
tgccatggtt tatgatccat gggcatgtgg ttgcgtttcg gcatggagca gtgcaggtct 240 
ttacggtgtc ggagggggcg gaggcgcctg gggagctggc ggtgctggag gagccgacgg 300 



cggacgcggc ggcggcggtg gagattggga 
tgatgaatgg gactgggatg atgacggtgg 
tggtggtgcc ggaggtggtg ctggtgctgg 
tgctggactc ggacttggat tgggcggagg 
tcttggcgga cttggcggtg gagacgattt 
agctcttgcc ctcggtggag ctggtggagc 
cgctgctgcc gccgggggtg gagttggtgg 
cgctgcagga ggaggcgcag gtagacttgg 
tgccgctgca ggaggcgcag gtggacttgg 
tggcggactt ggaggcctcg gaggtcttgg 
tgccgctgct gctgctgccg ccgctgctgc 
ctacggtgga cgaggaggta gacgcggtcg 
tgccgctgct gcagctgccg ccgcagccgc 
aggaggaggc ggaggcgctg gtgctgccgc 
agcttctaga caaatgagtg gtataaggga 
gagtaatgga gcctctgcaa aagcctctgc 
tcaaattgac gatttgaagg atgtcttaaa 
atctgcttca gcatctgcat ctgcatcagc 
cggaggtggt aacggaggag gaggcggcgg 
tgctgcagga gccggaggtg gacttggagg 
actagctgct gctggtgcag gtggaggagg 
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atatgactat gatgacgaca gcgatgacga 360 
aatgggagct ggcgccggag gtggtgctgg 42 0 
tgctggagca ggcgcaggag caggagcagg 4 80 
tctcggaggt ggacttggcg gacttggagg 540 
attrgattta gatttcgatg atcttggtgc 600 
tggaggtgct gctgctgctg ctgcagctgc 660 
agctgctgcc gcagccgcag ccgctgctgc 720 
aggagctgct gctgcagccg cagccgctgc 780 
aggactcggt ggcggacttg gaggactcgg 84 0 
tggcctcgga ggatatggag gatctgctgc 900 
cggaggtgga ggactcggtg gtgttggttt 960 
aggaagagga ggccgcagac gtgctgctgc 1020 
tggtggtggc ggaggaggtg gaggtggtgg 1080 
tgccgctgca gccgctgctg catctgcttc 114 0 
cgcattagga gacattaaag accttctcag 1200 
taaagcatca gcagtagcaa gcacaaaatc 1260 
ggatcttgca ggtctattga aaagcccagc 1320 
ttcagctgga ggtggaggcg gtggtggtaa 1380 
tggagctgga gctctagctg ctgctctcgc 1440 
tggaggcgga ggcggagctt tagccgctgc 1500 
ttttggtgga cttggaggac taggcggtct 1560 



-143- 

tggtggggga tctgccgcag ctgctgcagc cgctgccgct gctgcatcag gtggtggagg 1620 
aagagcactt agaagggctt tgagaagaca aatgcgtgga ggtggatccg ctgctgccgc 1680 
tgctgctgct gctgcagctg ctgctggagg tggatgggga ggtggaatgg gtggaggatt 1740 
cggagtaggt ctcggtggag gattcggagg aggatttggt ggtggatcat cagcagcagc 1800 
tgctgccgct gctgcagccg ccgctggatt tggtggaggt ggacgaagag gtagaggtag 18 60 
aggacgtgga ggcgatggcg acggtaacgg agctagtgct gtagctgcag ccgccgccgc 1920 
tgctgctgct gctggaggat ctgctgctga tgttgccgct gccgctgctg cagccgcagc 1980 
tatgtacggt gacggtgctg atggacctga tttcgataat ggattcggtg gtggaaacgg 2040 
aaatggaggt ggcggatctg gtggtggagg atccggcgga ggtggatccg gtggcggatc 2100 
tggaggtggc ggtggatctg gtggatcagg cggtggcggc ggatctggtg gttcaggcgg 2160 
tggcggatca ggcggcggtg gaaacaatgg atggggaaat aacggcaaca ataaatatga 2220 
cgatgatgac tgtgatgaat atggtaaccc tattagaagg gggtaaatta tttgacatta 2280 
tccgccattt gactcatttt tcttagttct ctatgtttta tacttcacct tagattgttt 2340 
tagtttgatt gaataaatta tgttttcgat ataaattttt tttaaattaa attaaacttr 2400 
attagttgac ctgtaaactt tttcatggag ttataatcta aggaacaaaa aacatacata 2460 
atatgttcag tattgtggta aagcacctgt accgcaaaca caatcacctc tatacatgta 2520 
tacaaaatca gtaatgctga caaaatcrtc tacactctca cctacacact cgcacacagt 2580 
cctcttacat acacagcact ataatatcct gaacatgaag tttgtgttga taaaaagttc 2640 
agaaaaatct cccctacatc acctgatctt tcactgaaaa tttacgacaa gtattgaaaa 2700 
tagcagaaag aaaacgggaa attgagaagt tttctataaa aaacaatcgg aacaatgact 27 60 



-144- 

ggaatgacaa ggatgaaaat aatgataact tacattaatt aaggccccaa taatctctct 2820 
attttcaaac ttttttttca aatgttctct ctaactcact tgcatctatg tggaaattca 2880 
catactatac taaattacca caagtatcaa ggtttcacaa cctctcatgc cttcatggca 2940 
gaccatgctg ggtatttgtc taacaatgcc tcataaatac ataaaactaa ctaacaaaat 3000 
aggtcagtct gtaacaaatt attaatgcac cattattgca ttttctaaaa caaagcatac 3060 
actggatatt ggcagacaaa atgttgttat tggatacctt tccattctat cragacactt 3120 
gctttccaca agtcatcata aataaatccc ccctatccca aatgtcaatg gaatgcccca 3180 
acccttcccc cataatttta aaacctagaa taaattaaaa catctatagt tcgtcatgat 3240 
catctttctt atcatcctct tcttcttcct cctcctcctt cttcttcttc ctcctcctca 3300 
ggttcttggc tgcctgctcc ttccttgcca a 3331 

<210> 32 

<211> 5224 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 32 

ggatcccctg ctcgacgccg gcggcccggt acacctccat cgtgcggacg ttgttcccgc 60 
gcccgcgagg gtggacggag gtgccggcgt gacgctccac cagcatgtgc cgcaccccga 12 0 
gccggcccag gaacacggac gtcgacaggc ccacgagcga tccgccgacg acgaggaccg 18 0 
gaaccctgtg gaccgtgtcc ccggcccgat cggctcttgc gttcatcttt ctcctccagc 240 
gcgtgatgtc cgcccactcg gccggtttcg gccggggtca tgcatgcccc gcgaqgctgg 300 
agcgcggtgc gccggggacc acacttcacc cgcttaaccc gctgcgttcg cgcaggggca 360 



-145- 



cggcacgccc gacgatcgtg ctcacgggcc 
ttaccgttcc tccaggaaga ggtgcgcctc 
cccggaccgt gtccccctca ccgtgttcga 
ggacatccgc gacgggacgc aagcggaggt 
ggtcgccgcc gtgccggggc acatcagcga 
ccagtggctc atcaccagcg agtgggagag 
cgaggaacac ctggagatgg tccgtcccct 
gcgctactcg gtgctgcgcg agacggccga 
gggcgcgctg cagccccggc cgcgcatcgg 
caccgtcaag cccgacagcg tcaccgaggt 
cgaggtgcgc gtggacgaca ccacgcggct 
ccgggtcgtc cgggcgatcg aggtgcgggg 
ccggcagccg gaggtgcgcg ccgtcgagga 
ggacctcacc gacccgcggt ccgcccggct 
ccaccacgtg gtgtccgggc gcgggacggg 
cccggcccac cccggcgccg gaccggcgct 
caccgtgggc gacccgggca gtccggtcgt 
cgtcgtacgg ctcgtcgaca cggcgggcgc 
cctgcacgag ccggacgccc tcgccgaggc 
cgccgacggc cccccggacg accgggcgct 



gacgcaccgt catgtgacgc gtcggccgcc 420 
aatgacggtc tctgccgctg tgtccacggt 480 
cggttcccgg gtgcgggtcg tgctgatgct 540 
cctggacgcc tacgagcgga tgtccgaccg 600 
ccagctgtgc cagtcgctgg agaaccccac 660 
cgcaccggag ttcctcgcct gggccaacag 720 
ggagccctac gtccgcggca cccactcgat 780 
ggagcgggcc ggggcgggtg cggcggcccg 84 0 
cgacaacgtg gtccggcacg ccgtcaccta 900 
cgtgaagatc ctctccgcct acacctcgcc 960 
cgtgcgcacc tccctcttcc tgtacggcaa 1020 
cgacctgcag gccgccctgc gccacgtggc 1080 
agccctcacc ccgcacatcg aacaggaccg 1140 
gttcttcacc cgggccgcgc tgccggccgt 1200 
cggcgacacg cagcggtgcg cgctgtacta 12 60 
cgcccggctg ctggcgcggc agggcgaggc 1320 
cgcctgcacc gtcttccacc gcgacgacct 1380 
accggagcgc gcgcccgggg ccgtcctggc 14 4 0 
cgggcggctg ctggacgccg ccgcgctcgg 1500 
gccgacgttc ctcgcgcacg cccggatgcg 1560 



gcctctgaca gaccgtcagt cgccggcctc 
tgaccgacat gacagaacag caggcacgca 
ggcggcgcgg cggcgacgtc cgggccctgc 
gcttcatggg cgtggccgtc gtacggcccg 
actccgagga gttcgtgtac gtcaccgccg 
cgcatcccct gcgcaccggg cagggcctgc 
gcaacaccgg cgacgtcgag gcgcgcctcg 
cggacctcgg gcacgtcgac accgaggaga 
catgagccgc cgggtcgtcg tcaccggcat 
ggcccggttc tgggacctgc tggccggcgg 
cgacccggcg cgcctgcgct cgcagatcgc 
cggcctggac gacgagacgg tccggcggtg 
caccgccgag gcggtccgcg acgcgggcct 
gggggccgtc ctcggcacgg cggtcggcgg 
ggtcagcgag ggcggctcgc gctgggacgt 
ccgcgccttc gcccccagca cgctcgcctc 
cccggtgcag accgtctcca ccggctgcac 
ccacgccatc gccgagggcc gtgccgacgt 
atcgccgatc accatggcgt gcttcgacgc 
cccggagcac gcctcccgcc ccttcgacgc 
cggcgcggtg ctcgtgctgg aggagctgga 



-146- 

ctgacccccc gctcgcccga cctcagggag 1620 
tcgtcgcctt cgacgacgtc ccgcccaacc 1680 
tcacgcccac gaccgcgggg gcgaccagcg 1740 
gagaacgcat ctccgagcac taccacccgt 1800 
gcgccttcga ggtggacctg gacgacgtgc 1860 
tcatccccaa ggacgtgcgc caccgcttcc 1920 
tcttccacct gggtccgctg gccccccggc 1980 
ccgacgagac cgcgccggcc ggggtggtgt 2040 
aggcgtcgtc gccccgggcg gcatcggcgc 2100 
gcgtacggcg acgcgccgga tctccctgtt 2160 
cgccgagtgc gacttcgacc cgtccgcgca 2220 
cgaccggtac gtgcagttcg cgctggtcgc 2280 
ggacaccacg cgcgaggacc cctggcgcat 2340 
caccacccgc ctggagcacg actacgtcct 2400 
ggaccaccgg cgggccgagc cgcacctgca 2460 
caccgtcgcc gagaccttcg gcgcgcaggg 2520 
gtccgggctg gacgcggtgg ggtacgccta 2580 
gtgcctggcg ggcgcctcgg actcgccgat 2 640 
catcaaggcg acctcgccca gcaacgacga 2700 
ccgccgcaac gggttcgtga tgggcgaggg 27 60 
gcacgcccgg gcccgcggcg cggacgtcta 2820 



-147- 



ctgcgagctc gccggctacg ccaccttcgg caacgcccac cacatgaccg ggctcacccg 2880 
ggagggcctg gagatggcgc gggccatcga caccgcgctg gacatggccc gcctggacgg 2 94 0 
cacggacatc gactacgtca acgcgcacgg ctccggcacc cagcagaacg accggcacga 3000 
gaccgcggcg gtcaagcggt cgctgggcga gcacgcgtac cggaccccga tgagctcgat 3060 
caagtcgatg gtgggccact cgctcggcgc gatcggctcg atcgaggtcg tcgcctgcgt 3120 
cctcgccctg gcgcaccagg tggtgccgcc cacggccaac tacgagacac cggaccccga 3180 
gtgcgacctg gactacgtgc cgcgcgaggc acgcgagcgg gagctgcgca gcgtgctgtc 3240 
ggtgggcagc ggcttcggcg gcttccagtc cgcggtcgtg ctgaccggac cggagaggag 3300 
gctgagatga gcgcaccccg gcgagccgtc gtcaccggac tcggagtggt ggcaccccac 33 60 
ggcatcggtg ccgagacgtt ctggaagacg gccgtggacg gcaccagcag cctggcccgg 3420 
atcgaccggg agggctgcgg ccacctgccc ctgaagatcg ccggccaggt ccccgacttc 3480 
gacccggccg ccctgatcga ggacacctac ctcgtccaga ccgaccgctt cacccacttc 354 0 
gcgatggcgg ccacccagct cgccctcgac gacgcccggc tctcccgcgc cgacatcgac 3600 
tcgccgtact cggtgggcgt ggtgacggcc gcgggctccg gcggcggcga gttcggccag 3660 
cgcgagctgc agaaactgtg gggccagggc tcgaagtacg tcggccccta ccagtcgatc 3720 
gcctggttct acgcggcgag caccggccag atctccatcc gcggcggctt caagggcccc 37 80 
tgcggcgtgg tggccgccga cgaggccggc ggcctggacg ccctcgcgca cgccgcgctg 3840 
gcggtacggc gcggcaccgc caccgtcgtc gccggcgcga ccgaggcccc gctggccccg 3900 
tactcgatgg tctgccagct gggttacccg gagctcagcc gcagcgccga cccgggccgg 3960 
gcctaccgtc ccttcacctc cgccgcctgc gggttcgtgc ccgccgaggg cggggcgatg 4020 



ttcgtcctgg aggaggaggg cgcggcacgc 
gccggccacg cggccacgtt caccggcgcc 
gcgcacgcga tcggcacggc gctggcgcgg 
gtgttcgccg acgccctcgg cgtgccggag 
gacgcgctcg gcccgcacgc gcggcgggtc 
cgggcgttct gcgcggccgc ggtgctcgac 
gagctgatcc cgcccacccc ccatgtgctc 
ggccgggcgc gtcccgcccg gccgcgcacc 
aacaactcgg cgctcgtcct gcgcaggggc 
acaggtgtct cacgtcccct tcgggcgcgg 
ccgacatgac cgaacgcgtg ggcacccagg 
agcgcaccgc gggcgtgcac gtggaaccgc 
tcgacggctt cggcctggac tccctgggcc 
agcacggcgt gggactgccg gagcaggtgg 
cgcaggtgaa cgccaccctc aggacggcgg 
atcgtcatcg ccgcgccgct ggacctggtc 
ccgcggctgt tcagcgagta cgcctccgcc 
cgcttccggc tcaccatgca cccggacgac 
cgcgtcgccg accgcgcctc cctgacggtc 
cagttcatgg acatccagtg ggtgtacgag 
atcc 
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gagcgcggcg ccgacgcgcg ggcgacggtg 4080 
tcccgctggg aggagtccag ggccggcctg 4 14 0 
gccggctgcc gtccgcagga cgtggacgtc 4200 
gccgaccggg ccgaggccct ggccctggcc 42 60 
cccgtcaccg ccccgaaggc gggcatcggc 4320 
gtggcgaccg cgctgctcgc catggagcac 4380 
gacgtctgcc acgacctgga cctggtggtc 444 0 
gcgctggtgc tcagccgcgg actcatgggc 4500 
gccgcgccgt tccccgagta agtaccccga 4560 
gcacccgagt caaggagctc aaccacatga 4620 
tgaccttcga ggaactgtcc gccctgatga 4 680 
ctgacctgcg ggcgcgggcc gaggagggct 47 4 0 
tgctgggcat cgtggccgag ctggagaaga 4800 
agcgctgcaa gacgcccgcg gagttcctcg 4860 
tgtgacatgg ccgggcacac cgagaacgag 4 920 
tgggacatga ccaacgacgt cgagaactgg 4 980 
gagatcctgg agcgcgaggg cgaccgcgtc 5040 
gagggccggg tgtggagctg ggtctccgaa 5100 
cgcgcccacc gcgtggagac cggccccttc 5160 
cagacgcccg agggcgtgct gatgcgctgg 5220 

5224 
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<210> 33 
<211> 30601 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 33 

gatcttagac cttattcact tgatacgtgt aatagttatt acgatagtat gtttttggcc 60 

gattcctccg cgtcttcttt cgacgacgtg gaggtggagg caaaagcgaa gtagttgtgg 120 

aagaataaga attatgatta tcatgattat tattcaaatt aactctattg ttacgtaccg 180 

cgctccatgc agacgtttgc caggagacga cgggtggaag gataggaagc gaagaagcgg 240 

aagcggaaga cgtcgtattt gaattcgaag atgataatga tgtcatrgat gctgatgatg 300 

ttttgttgtg ataatgagat cggcatggag gcatttcaca atctctttgt tcgcgaggct 360 

taatctctga gcactcgata tcgtcttgtt tacgaccgga aagcacgtct tcacaccaga 420 

ttttgcggcg ttgaactccc ccgccacacg atacagaaca ctggaattar tattagaagc 480 

ttcaatgatg ttctaagaac ttacgtgagt ccatggagat atctgccaag aattatttgt 540 

gagtggtgga catagttctt cattgcattt atcaaacatc tttggttttc tggtttcatc 600 

gcaagaagac gaagtgcaag atacactgcg acgacgccat cccccaccac aagaagcaga 660 

gcactgaaca atcgtacatt agtaaattct aaatctgaaa attatatatc ccacttttga 720 

ccaatctcca ataatccagg atccaatatg ttcttctccc tttggacagg gttcaagtcg 780 

gcaatttctt gcacttgttg gtctcttttg cacatcacaa tcaacatctt tcaaaatcgt 840 
ccgaccaccg tcttccgcac tgacgcatgt aacatttcta ctttgttgaa catgagttcc 900 



acaagtagct ggacactctt cccattccgc 
acaatttctt gttgatactt ccttatccaa 
aacatgatct cggcacttga gaagacgacg 
ggctgtccat ggtccggttg cccatcggat 
tactctgggc ccatcacaag tatctttttc 
ctgaaatgat attttcgttc aagattaaaa 
attcatcaac aatagttcct tcagatccac 
ctgatccaca agtgactgag caaggggacc 
gaagatggca agtttgacta gtttctggca 
cagattgttc gcggtatatg cattcggttg 
cagaacactg aaacgtattt atggtattga 
ttacagcact ccaatcagta tttctccaaa 
gagaagcagg ccgatcagat gcagtaccac 
cagaaacaca tgaaactctt cttgacgacc 
accattctcc aagtttatat tttggacatg 
tttccatttt aaggcaacga tgttccggta 
cgtctcgata ctgtacacca tctccacact 
gccaccaagt acaagcatgt tcattacaat 
atgtatgtgt ttcccgatcg gctgcatcca 
catcacaact tcgagagcac tctgaccaat 
tgttacattc ccgttctgtc gcctctttct 
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cattttccag tatgaacaat cacgaaggcg 960 
atgattgcaa tactcatcag gaagatcacg 1020 
ttgagtacca ttaccacaag ttgctgaaca 1080 
tggtggtacg tcggcttgaa gtttttgaag 1140 
acaagtcttt tttaggcgtg gacgagtatt 1200 
gcaaactgaa acgtactcga tcacaaaaat 1260 
gagtacacga aacacttcta gtctgaattc 1320 
aatgacttgg tttccaagat gtacatggca 1380 
ttttggtatc tccacaaaaa gaagcatcaa 1440 
tacgttcccg atgaccgatt ccacaagata 1500 
caacagcaat tctggagtat ttgaataaac 1560 
atgggcaagt gcctaaatta catgtctgat 1620 
aaagtgacat atcgacttca gttccatttc 1680 
atccatcctc acaagaaaca ctacactgag 1740 
attctctatg acatggtttt gtaattatct 1800 
gtactgatct atgacgatcg gtacaattag 18 60 
tagctgagca gtcagaccag actccgaact 1920 
gttcttgtgt ctgtgctgga ccacatctgg 1980 
aacattgagc atgacgcatt ttgactccac 2040 
gcccataaac ccatcttgga catggaattc 2100 
gttctctgcc gcacaatgac tcatcaactc 2160 
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gacgattcga atcatcaacg caatatgact tccgatgcat ttttccattc gatccgcaag 2220 
tttcagaaca tgaagtccat tctccatagt tccattttct tccagagcag tcaatgtaac 2280 
aactggcaat atcggatggt tttgaattac gatcacatag atgttcggat gctggagttt 2340 
gacgatcacc ctccattttt acgcaagaaa ctcgttgacg tttctgtcca gatccacatt 2400 
tggcactaca actagacaca tcttcagtga tccatctgta aatataaaaa tttattatag 24 60 
aaatctaatg aaaatatgta gtttaccttg tagaacaatc tatattgcac attcgtgttg 2520 
cttgttttgg tttgagaaca ttttgacaat ttctatcatg actttgacga tgagtcgaca 2580 
tgtccagaca cattaatttt tgcgattgct gtccacgaca ggctctatca cattctgtcc 2 64 0 
aagtatccgt aactctccac aaatacaatg cactggatat tggccgaatt acagcatttg 2700 
gaacagccgc agtcatgtac tcatatgaga tgtcgggtgg atgactacca acagaaagaa 27 60 
catgaacata aatgtcactt ctaatcggac cagttccatt tatccgttca ataattgcat 2820 
cagaaccaga atattcgaga acagtgtctt ggaatgcaat ttgttggcga gccagtgata 2880 
cttggaaatg accgttaagt aggaattcac cattggcggc acggagagct gaaggttaaa 2940 
ataaagattt tcatgggtat tgataacaca agggtgagtg atgaaaaaag taaatgttcc 3000 
aaaaacactt tgtatagaaa ctcacaaaga taattgtcat cttctttcat attattatat 3060 
cctttctgcc ggatatcaat atttgcagaa ccagctggaa tcttcattac ttcgttataa 3120 
ccaaaggttc cttgctcatt aaatgttcct ttgacaacct tacaggaaga atcatcccca 3180 
ccgcaaacac cacatttgtc tcttcggaga gttgaatgaa gttgatgatc acagcctgaa 3240 
aatccactta ttttcaattt tcttttgaaa tcagatcaat gttacctgct ggcatacaag 3300 
ctccagctac acaaatatcg tctccatttc tatcacatgg tgttccatca acaactttat 3360 



ctcgaagcag atagaacgct gcagatccac 
ttggtgcaac attcgcatat tttggaaccc 
ttccaatatc tttattgttg aattcagaac 
atgggcattc ttgtgtatta catgaccgat 
caccatttcg aggtctaaag taatatggga 
tagccagtgt ggcctaaaag ctggaagttt 
ataaccggct acttcaggtg attctataaa 
gaaaagcctt aaactatctc gaagcggccc 
atttaagagt gatcagaagc actccatacc 
cctcgttttc acttctgaac tcgccgattg 
ttcagactat ttgatatcat ttcgtttggc 
cttaatcctt tttgaacacc accaccacaa 
tcaccccatt gtccgtcaat tttggtaagg 
tgatgacaga actaaatatc caagttttta 
actcaccatg cttcttgatt cgtcacaagg 
tcgacagccc atctggcttc cgtagaatgt 
taaaatatca atgtttcatc ttaaagaata 
aactcagaag ctggtccaaa tacaaacttg 
ggttcatcac gtacaaagac atcctcgtag 
cattgagttt gacctcgatt atttctgaca 
atagctatct tactcgagga atcgttcgag 
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tgagccgaca atacagcttg caacgttcat 3420 
agtgagtatt cgttgaagcg acaccttgga 3480 
attgaacttc acggtatggt tgagtatccc 3540 
aacgttctcg ttgaccaaca cagtactttc 3600 
aaatgtcatt ttaatattga taggaaagct 3660 
ttttaaagat gcgttttcta tcaatttaag 3720 
ttttataaag cttggaagct aggtaaatct 3780 
gaaagcccag aaaagcagag acggacaaac 3840 
ttgatgttac atttgatttt agtgtttcca 3900 
aaaatatttt gattgaatat attatttgct 3960 
agtttaactc actttgggct gtcacaatct 4020 
gtacgactgc attctcccca tgatcgccag 4080 
gattcggggg ctagacgaac acaggctcca 414 0 
tgagtttctt ttgtgattaa tttctgagat 4200 
agttccgtcg gcccatggca tatgctgagt 4260 
tgcacaccaa agacggcggc atgtcggctg 4320 
tatttaggca aactaaccat ataagggcac 4380 
cactgttgat gagcatcgta tttctttcct 4440 
taacgacgtt cgaccggctg atcgaataga 4500 
aattgacaat taaatagaat caaaatttta 4560 
cattccagct gaacatggcg accaactcca 4620 



tggatgagtg ttatattcca acgttggtgc 
tttatcaaat ttagtgcttt ggaacttgca 
ttttcgttcg tcatcatgag gaatcgaaaa 
tgcagcactc aatccattgt cttctatgat 
tccaagttca gcaagtccaa gtgtatcgca 
cgtcaaaagg attgcaacgt catgatgttg 
ctgctgccat ctacagaaat cttgaagtgt 
agcgttttcc gttttcaaaa cgatcaactt 
ggattggtga cgatagatgg aggcaactgt 
agatcttccg tgatattcgt acatttttgt 
atcccaagag ttggcagctc ttcgggatct 
tttcataaaa ttatttaaac atttttttac 
agctccactt tgacctatct ttgttgtcat 
aatggggatg caagactaat aaaaggtatt 
tatctgtttc attgattcaa ttttcagatg 
cataacatag aacagcctgg cagcccgtga 
gcattcgacc acacacaagt gatccattat 
tagctatagg ggacccacga gcaaaattct 
tgggctatta cacaacaccc gatgggatca 
aaaaatattg tgataacttc tcttttttct 
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cattatgtgg aagttgttct gaaactgcgt 4 680 
aaccttatta accggcatgt aggtagagca 4740 
cacatgaccc aattcatgag caattgtgaa 4800 
tgcacaactt ttttgcatat cacacattgt 4860 
ttttccttgt gatcgacaaa tatctttacg 4 920 
gacactcgaa tcatctggat cattgtaata 4980 
ttgttgagcg ttctgagtga ttcgtggtcc 5040 
gacaacaacg acattgatag atgcacgaag 5100 
ggagaagaga gtgagaacgt agtcttcaag 5160 
atccgccacc acaaggactt caacatagtg 5220 
tgctttgcgt tctataatta aatccttttg 5280 
tgtatccttc ctattaatct tgcaccccag 5340 
tgactctatc aaaaactgtt caactatgaa 5400 
ggtaactggt tccagtagag ctttttttac 54 60 
tttatataac catcttaacc gttcaaatct 5520 
aaaggtgctg aaatcccagt aatttcaatg 5580 
cctttgctct tttacttcgt taactaccat 5640 
atagtttctg tgtgtgttag ggtgttttaa 5700 
gcagaatctg agatcttttg ggaaccggaa 57 60 
acatttttta cagaactagc aggtaaactt 5820 
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tcagattgaa atctcgaaaa atgcatccgc ctactcaaaa agtcgttttt aaaatgattg 5880 
tttctttgtg tttgtcctct ttttcccgga cgtacgcaac acaaaaccgc ttgcgcgagg 5940 
atgtacacaa aacgtacgtt ctgcgcaatc ttttccctgc agctctctct ctctcacttt 6000 
ttctactcca taaatcagtt ctctgtctgt ctcccaccac ctaaatcatc atcagcatca 6060 
tcacagtccc cccaccaagt tcttgtgtct tctctgacct ttacacgtcg actagggaaa 6120 
agctctcaag cagacactcg agcgccagtt gaaaaaaata gtgtgtccaa atgagcagtt 6180 
tcgaatttga accgtttgtt cttgttctga cataaaccca aaaaaacgaa ctaggcggca 6240 
aaagagatct ggataatcta aagaatctag acaaatttca gaagttctta ccaataacat 6300 
cttcccactg atcttgccac gtggcaaccg tcgtcuccgt ctcgttgaca ctgg-ccgagt 6360 
taagatggtc aaacgatttg aagtgcattg gatcgaactt tcggacgaga tgttgcctat 6420 
ggcgacttgc tccgtcgtgc tctagatgtt taaagtgtca gagaaagtga ttacaaagtt 6480 
tctacctgtt ccgtttccac taataattgg ctcaaccgta tggattccgc tgggtagtgc 654 0 
aagcattccg tactgaaaaa ggcttttatt caccaaaatt cgaactuata caaaccaatc 6600 
cgtcttccga gtcgcataaa ttgacgatgc tgtgctgatg tacaccttta acgtgtgcac 6660 
ggtagataca atcgggatct gttcgagaca ttccacctct aacctcctcc tccgagtcca 6720 
aatataagac catcggcgcg aaatttgaat tggaaaagtg gggaacactt ctggaattga 6780 
aaattaatac gactgttata ataaaattga aatctcatac ttgttatgtg agtccggtat 6840 
ttgattccat ctgtgcaaat gaacgatgta gacggcatca tctgatcgta atcgtaagtg 6900 
acaagcatgt ccacagtctc tggcaactcc ttggagtcga cgtcgccgat ctgttgacgt 6960 
gacatcacgt tttccacgac gtccataaga atctcttcgg acgatgtgat ggctgtcgat 7020 
gacgtgtata ccggcgtctt gacgccatcg actgtgatgc actggcacac ctgaaactta 7080 



-155- 



gaacattatt tcacttcaaa actttttgga ttgttacctg agtacttggc cctggagaac 7140 
agcacatctg atgagaattc tgagatcgtg ccactccctg ctgaaaggaa catttggtta 7200 
aaaacaaaag ctgataaatt aaaataatta gataaaaacg aacattgcaa cgcataaacg 7260 
aggcagacga cgaggagtat gagagcggcg acgacgggct gcagcagatg gaatgagccg 7 320 
ccgatggagc gcataccaac agctccgtga tgatgattat gattgngtgg agagcagcaa 7380 
agaaaaaaga gatggaaaga agcagaagct ccgataaagt tcgtccgtct cttctgaaac 74 40 
cttccaaaaa ctacctgctc gaggtgaagg gaagtcgtct gattgaactg ctactgcttc 7500 
tgatcttttg ataatctccc gagtttgtgt tttcgtttag tcgaattaaa attgtagatt 7560 
gtggaatgag cacttgcaat agggaacaga gcatcacaga ctgaaaaatt aaaaattatc 7620 
tagaatgcaa gcaattttta aatttgtttt aaaatcactt attctgacgc catcttcttt 7680 
tccgatttgc gcagaataaa taaaaacttg actgtaatat tgggaaaatt tcgaaaaaaa 7740 
acaccgttaa gtctgagccc acctttcgcc tttttttgtt gacgaaaaaa accaaacaag 7800 
ctttaaattc ataaaattcc caattttaaa aacatctaaa gtcaattcct cccaataatg 78 60 
catttgtata tgaacaaaag tctgttgacc ataagtcgtt atattactac aagcaattgg 7920 
tcatcaacaa acctcataaa aatcagtttt gaacgggagc aatttatata aactctgtgt 7980 
gctcttttgc tctttttctt atttcttagt tgtcttctag ttccgccacc actttcgctg 8040 
ctcttgacga aatctgtaaa ttgttcgtca tttttgattt ataagatttg tttggctctc 8100 
ggraggagct ctcaagctgc taatagtcct atagtaaagt actaaaaaca caaagaagca 8160 
gatgaaggtg tcataaaaca ctgataagaa tcatcatgat taggttggtg cagagaaaag 8220 
aagaagaaga aaaaggagat ttagagaaga gaaacaagaa taaaaatgca aaaataaaaa 8280 
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aaatagtaat aacaatgaac gcagagtctt ccatgttgga gaaggaacag gacccatgtt 8340 
gatgtgtatc tgaggggatc caatgtgtag tgatggtagt aaacacttga gagggaactt 8400 
Gcacccccga ctagatgatt ggaagcaatt gatgatagat gtagagccaa agaattggga 84 60 
cctactaatg atctagtcaa gattcttctg ataagagaaa aagacaagga agaacatgaa 8520 
aatgactggt gattgaaaaa taaaacggtt tatgaagtcg gggtgtacta aagatgcaag 8580 
gtctcttgtg acgtattttt tcttccaggc acgttcgcgt tattcacgat tttatgcaaa 8640 
caaggtaagg agtgttttga attttgaata taaaaattta aaagaaatta aagttagaca 8700 
tttgaaaaat tagacaccct catgggaaaa attatagggc gaggagaggc ggtgagaggc 87 60 
gccctaattt ctgctcggtc gggtagaatg tctaatctaa atcctacctc atgtttggct 8820 
ccttcttaaa tcaaaagctt aaggtcatct ctgaaacgtg cagttgacaa gttcaatggt 8880 
aagaacaggg agcaagcatt tacaacaaaa aagtaaacaa aaattgcatt tgtcgcagtt 8 94 0 
caaaatggaa caactcactc ccactcgaga acgttttgaa ggggagagga agaagaggaa 9000 
aatcatcaca caggcacatg gaacttctgg gacacaaaac aatacaaact gggtgccgtg 9060 
aatctcagta cacacacaca aaaatcaaaa aagacggaaa ttaggagcag atgtggtaaa 9120 
gggtggttca atgctgatgg gagagagagg gagaaacttc aaaaaaagaa gtttagattt 9180 
atgttggcta tttcaatcct aaatttatct aaacaattct aaaaatgctg gttttggaag 9240 
gttatctggt aatggtgaag ttttataaac aaaacaagac aaacaattct tgagatctta 9300 
aaaatcttag cgactacaac aatatttagg tattttttaa tggaaaaaag tattgattgt 9360 
tgacttggga aattgaacag caattttttg tacttttaaa tcagttatat tttaactttt 9420 
tagagcacat ttcgtagaca aaagggaaaa cgattggtcc aacatgtgaa gatgatgatg 9480 
tcaacaagtt ttggatcgga gccaaaaaag aaacaaaaca ttcataccat gatgggaaac 9540 
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aagaggtgca gcaacaactt ttatcaatat 
cccagccagt aattcttttc cgtagagttg 
gatcaagaga cgcagactat caaagtaaaa 
aacggaggaa aaaaggacga tgataagaga 
ggagagcata ttcttttgag aaaaatggga 
gtcgagcggt cggcgggaag acaaaaagtg 
atggcacGca ccgtggaaca tgatcatgat 
ttttttgcaa cggcgcgttt tggaaaacta 
tataccggga tgtctaagaa ttgtaaaatt 
caatttattt tggttatttc taagcgcctt 
attatctaaa ctgccgttct tcctatattt 
gactatacct cgattaatca taaaaatgtc 
aaatacaaga aatttatcaa ttgccaaaaa 
aatttgtcaa aaatttagtc aattggaatt 
gaatgatgca gctaattttg cagtttaagt 
aaaatatgaa gaagagtttt acgaaattaa 
tatgaaatct gatcccgact aagtttgatg 
tctatagaac actaaataaa tacattcact 
accgtactca cttttgactc aatgacccgc 
tctgaaacag caataatcta acaagagaaa 



tttgtttatg ttttgattat ttttctggca 9600 
acctagaaaa tgttggaggc ggagtcttag 9660 
tgagtaaaag gaagtgatat aaact"cagga 9720 
ttgaagactt ggaagagtgt gctctttgcg 9780 
cctaggggca actgacgcaa ttgaaacatg 9840 
aagaaggatg ggcaagaaga agcaagagaa 9900 
gattgagagt gaaaattgga aatctcgaac 9960 
acaaagttga ccaaaaaatt attttacatg 10020 
gagtgatcct ttctgtgaca taatttaaag 10080 
tttatactag catgttatat tgttaatttt 10140 
attattgcac cccctttgtt cattctgaca 10200 
acaaaagaat aaaaacaact aaaattaaga 10260 
ttcggccaat cggaaaaatg cttggttgcc 10320 
tgtcgatttt ccgaaatgat atgaaagttt 10380 
ttacattttc aagtttactg taatttttcc 10440 
aagataataa aaaagcaatg caaacatagc 10500 
gacataggat taataatatt agtctaactt 10560 
ctcgaaactc tcccttttct gccatcaact 1C620 
aactgtcaag atgagttagt ttcaagattc 10680 
ctgaaaaaat agagtaaaac taataataat 10740 



accacataaa ttgacatgca tgatagatga 
tttccgagaa atcctcatag tttttggtaa 
actattactt ctaaagactt acctgattag 
aatttgagaa gttgaatcga gtttacgatg 
tcaagcatag aaaaaatgga aaaatacaag 
gcggtggcga aaccgcacac atttttgtct 
tctctattta ctgtccagaa gaatgaagaa 
agaccgagtg atgagagccg cagttttgca 
gtactacgag aagcacacac acatacgaaa 
cgacatattt aattttcgaa ctaaagttta 
caaacaactt tcagttttgc ggaagaaaat 
caacactatt gcgtatcaaa aatttttcga 
ttttggtttg caaacattta aaagaactca 
aattgtagga tacaaatagc tacagaacaa 
tttaactaat acaaaaacac agaaactttg 
aagcttgtaa aatttaaata tggatacagt 
catttattta tatatcttgg caaatcaaat 
tctacaatta aatcagttgt tttttcatcc 
acaagggcaa agagtacatt aggaagagtt 
gacctctttg gattgaaaaa gcgaacgaat 
ttcgtggcct cgagaatagc aaattggcgg 
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ttttccggtt ttcaacaaga aaaacaacaa 10800 
gaaaaaataa attgatagtg atacggtatg 10860 
aaacgtgtag taattgaaga agaaaagtrg 10920 
tctgaaaaaa acatagatat tatggtaaga 10980 
aaaatagaga ctagagattg cataggtttt 11040 
gtgttatctc taattttacg ctctcggtgt 11100 
tatgggggaa aagtgcgcgg gaaaattgag 11160 
aaactttttc gggcaataat ccgccggcga 11220 
actgttgagt taaaacctaa aaaattgttt 11280 
gagggtctgt gcgtgcattt ttgaattttc 11340 
tacagcgatt ttttcgaata tttctgaaaa 11400 
tttgccaaaa ttcagactaa gttttggtgg 11460 
aaaaacattt ttagatgttc gaaaccgtac 11520 
ttagaatata aaatagagtt gtcaaacatg 11580 
aaactcgaaa tttttatatc aaaattgaaa 11640 
acaaacaata taatcataga tcaaatagtt 11700 
cgtatccctt acccactcat attcgatgag 11760 
tcccggacta ttagtttaac ttccacttga 11820 
tatgatgaca ggaaaaaagc tatgtaaaat 11880 
tgaggtttag gacccccgga aaatgaagaa 11940 
aattaattat ccgtaagagt gtgaattgga 12000 
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aacaaccggg acgaatggat tactgaatca 
tgtgaatcgg atgaaatgtg atgattttag 
gacgcggaag aacaggaatg atcaaggggt 
tgctccaaat gtgagagata atcaggtagg 
aacgatagtg cgcaggtgct tgatttctgt 
gaacttttca aggggtttcg gaaagggttc 
ttggaagaac tttcaggata acatcgctca 
tatataccga ttattgaaac attttaatcg 
tttgaaaaaa aaatcacaat tttttcaatt 
ataacttgca acccgggtat ttcattctaa 
aatagaaact cgttaggatt cttcacatat 
cagatctaag ttcagtctaa ttagaatcgc 
tacttaggca cagggcatcc ttctaacttt 
acgtgaggaa tcaaggagaa aaaggaaaag 
gatttctgtg ctcattcctt gttcggatga 
gcgggaatcg ccattgccga actttttcaa 
acgttccgat ttttaaaatg gaaaaaaatg 
gcgtgacatt tgttaatatg gaaaaagcat 
cagcacacat tttgaagact tttttacaaa 
gatagtatcc ttaatgctta atcatatcca 



aaaatgaaag aaagaagaga tgaaaatacg 12060 
aataacctaa atgcaacaaa acgacgtaaa 12120 
acatcttata ggggaaaaat gcactttttg 12180 
aagagacgta gaataggaac aggaaacggt 12240 
gcttttgcat gtgttccgat ggaatttttg 12300 
gagatttcgc atgtgagctt tggaagaatt 12360 
agcttgtttg ttagatttca gacttcaaag 12420 
tttcttacta ttagtaaagt ttaatcacag 12480 
atttagacca aactaattat ggtacagaaa 12540 
tttttttcat ttggaaccac tagtttttga 12600 
tatcataact atcagtattt tgttgcacat 12660 
aaatttgacc atcacacttt aaaacaaatt 12720 
tttgtccccg acaaaatgat gacaaaaatg 12780 
aacaggaagc gaaaagtagg agaagctctt 12840 
gctcactgtt tgcaacattg gcgttggtgc 12900 
gagacagaga gagagagaaa gagaaggaaa 12960 
aaagaggaag atgatgaaaa aatgaactct 13020 
gattacttca aaattgtaca ctaatcccca 13080 
aacaatggtt taagcaagct ttaaaaaatt 13140 
agtttagttt taagttttga tttcaaaaat 13200 
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ttctacatca aaaaatcata cttagtgatt atatgcaaaa caatttttaa attcaaggac 13260 
atatttttga tttttggaag gatgataact tttttgtgat tccgaaaaag attaaagtag 13320 
gtttaaaacc tctgaccttc tacagaaaaa acattacctc tatgaatttt ttttcatctc 13380 
gttcagaact tgtctcgggt caagccatga agacatgaga tagggtgtaa aacgttccga 13440 
agagaggttt atgactatta ttgtagttga agagaaaaat gatatctcaa tggatttcat 13500 
acagatggtc ggatttcatt cataaaatat cataagaaaa ggtacgttta tgactgtcta 13560 
ggtcaactgg ttttaggttt cttggaattg tttcaaacat ttttaggaaa tattttcttg 13620 
caaatatcta ctaaattgaa gtttgttatt gtttttgaca tattgtagat tttagagaag 13680 
aatcactcag agcaaaaatg ttgggaaaac gtgagaaaaa tccaagagac aaaagaatgg 13740 
tcttactatt agtagatcaa aaaaccagac caattattca tattcctact attcaatata 13800 
tattcaaaaa tgagcaaacc aagaaattgc acctaattta tcatcccaca tatattccga 13860 
cgaaacattc gctctacctt ctttttttct gtctaggaat tataaagggc cataattata 13920 
atttcagtca aggttttgga aaattgttcg actaaccatt atgaaagtta aaaaccaatc 13980 
agtcaaaaca cacaatagga atataaaatt cgtagaagaa aagctttttt tttggtcgaa 14040 
agcaaaatca aattctggaa ctgcgacttt tttagtgcaa ttatccattc aacgcaagtt 14100 
gtctttcaaa atttaaattc cagaagagtt ataacaaaac agacaggtgt acaagtaaaa 14160 
gaaaaataca agttttatcg taaaaactga tacgaatcta gatacacctg ttaaaaaagg 14220 
ctttctcgaa acccagatgc cgtacgaagt aagcagcagc caactaaaca ttttgagtaa 14280 
acatatggca agtgttttgg cgcaaattgt aaagattttc cgtgtgggta actagaattt 14340 
gaaactgtaa gtatgacgac ttaaccacac aaaatcaaat ttcaaaagat cttaaaatgt 14400 
tcgaactttc aaaactttta agctctctcg catctaccgt agtcttctaa taacaacagt 14460 
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cgtaagagaa agctcaaaat ttttcaaact ttttctgaat gacagaatca gttgtataca 14520 
aaaaaaaccc ccaaaatgcg agccccatga acctgacaac cagacaagtc gaaattgtaa 14580 
aatcgtatag atcttggttc acgacatgaa gagcaccgcg ggggcacacg agagcaacta 14640 
ctgcaagcgc tcctgaagag aagaaacatc ttttttccag gaccactggc oagtagtgct 14700 
cccccagatc actttctttt ttcttgcttc atctgatttg tgtctgcgtc gtctgatctc 14760 
tttagaacct atccttcttc ttcttctttt tgatacttcg acatcagaac aacatcgaca 14820 
tgtatcatct tttctctttt ttttttgtta tctattcatt cattcacttt tcatttagtt 14880 
tgattaatag gtgacatgaa ctcttgtcac ttttcaattt caacttctta aatcttaaac 14 940 
tcacagtgat tccagatatg agcaactcca atgaggtgtt gagtagaaac ctaaatataa 15000 
cattttggat gttttgataa tgttggaaca aataaattga aacaaacaag acttgaaata 15060 
gagacaacgt gcagaataat gtctaccagc tggtttcagt ggcatattgt accacgaacg 15120 
tccgacagaa cgaataacat aaagatcaag aaaaactgtt tgggagcaga caaacaatca 15180 
gaacacagtt ttgttgaggg gaccaaatca taattaatga ctaaatttta acgaagaaag 15240 
tgctcgaaaa gaacagaatt tagaagttga tgaacaatat ttttactttc agattaacaa 15300 
ttatgcttta caaatgacat ccaatctaaa gcatctggta atctgaaatt tgtcaaaaca 15360 
gctttcaaga ctagtttcaa atttgtcgat tcaatggatc aagtgtgtaa ttgatccaat 15420 
aaaaaagagt ataaagtgag aaggaagaaa gtgtgaaaaa agaagaacgt gaaacgtgca 15480 
gaagatacga aatgagtttg aagactgcac ttttcgagcc tcgatggtca gtcacttggt 15540 
cagttgcgaa aaagctgtga aaatgataca ttgtgtcggc tctcgtagag aagaaagcca 15600 
catggtcagg atgactccaa ctgggatatt cagttgtaaa gaacacaatt gatatttttg 15660 



catctttttt aactagtttt tacaatatga 
ttccttgttg ccgaagtgta tttccctgga 
tcagcagaat gtgttcttac attttgttgt 
aaaatttctt caaaatgttc ctacgtcttc 
gagcacaaaa cacctgccga cgcctctaaa 
aaaaaactac aaaataacca atacaaattg 
gaggcgggca acaagcgtcc tcatgacgcc 
tacatacaag tttctaaaat ttaacttaca 
agttgtggca atgaaatgtt ttaaaaatct 
aaaattcaca aagaaatgtt acaatctgtg 
cggacgcgga tattacacat tcagtcacac 
tctcatcatt tttttccaaa gtttattcta 
acactattct taaaattaat gtataatagt 
ttgttgataa aatagtgcca aacatcctta 
catttttcca ccaaaaaaca ctgaaatggt 
tattactttt ttgacatgaa attttcaatg 
ggaaaaattg aacggcgtat tgcaaaaatc 
agcaggagaa gcggaccgaa gaattcagaa 
caaacaatgg actgttgtat aaaaataaag 
aatgtattag aataggtttc actacctgtt 
agtttgtctg aagtttatga cgggaagtgt 
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gaaattgttc tgtgcgaaaa atatgacttc 15720 
aattccagta aatacctaat gtaaaaaatc 15780 
aataataatg tattaaaatt gcattaatta 15840 
tatgcacatt atttaggtca cagtttcatg 15900 
atagttataa ctgcgcatga aatcaggtag 15960 
agtagggcga tggagaggtg ggcggttgga 16020 
ttgttcattt agaatgtgtt tgctttgaat 16080 
aaatttaaaa aaagtcacaa caataataaa 16140 
aaatattgag ttttaaataa atgatttttg 16200 
aatgaagacg aacaatgaaa aagtgaggaa 16260 
aataaacgtt cggacactac cacacatttc 16320 
aagttcaata ttttagtttg attattttgg 16380 
ttagaaaata ttttgaaaca tgaaactttt 16440 
tgttacgcag ttatccaacc acattnttct 16500 
ccataaaacc tattcaaatg gatatgagaa 16560 
atgtaatgta aaacaaagaa aaatattgcg 16620 
ggtgtgcgga ggaggagaag gaaaaggaag 16680 
gcttttaaaa taagaacggc gactttcaga 167 40 
cggaggcggt agagagtcaa agctttcaga 16800 
gttgaactca aaaaggtgtg aaaaagtgaa 16860 
ccatcaaata actttcaaaa tttgacttat 16920 
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cagtgagaaa aacacgtcat tttggaacgt taaaatgggt ggcaccgcaa aatgttcaca 16980 
atgtgaagtg aattacgtaa taaaatcagt tttattaagc ttattaaact aacccttccg 17040 
gactatttgt ggaatgaaac aattgggggg gttttttttt ccaattttcg attttttttt 17100 
gaatttataa ttaccggaac aaaaatatct ttaaattatt aagatttgag tgatgtttga 17160 
aattttgaac ctgcaaaaca taagcacaaa ataatggagt ttttgtttta aaatatcaat 17220 
aggtgttttt tcacagaact ttaaacaaca aatactcata atttgaatga aaacagtaga 17280 
tcccacaata ttttgaaaac ttatctatat atatatatat atatatataa ttacgaaaaa 17340 
aaaacaaaaa gaaaaaaaca aataatttgt cagttgataa tttttagata tgagttgcca 17400 
aaattgggca atatggtgaa gaaatacggt agttcgtcgc actgtcagac taattttcaa 17460 
gtgttcctag tggaatgaaa ctaacagaag ctatacggta tataatatta ggaacacaat 17520 
taaaacgaac agcggaagaa aagatctagt ggtcacttcc gatttctcag ctgacttttg 17580 
aatgggcacc tatcatcatc tcacttgttt atttgaacag tctcgacttt ttccaattgt 17 640 
tggcttctag ttcaagaaac gaaaaaaaga gcaataacgg aacagaaaat tcagaaagtg 17700 
gaagagaaat atgagaaaat gatgatgata ataataataa gttagaagag ggttatcgat 177 60 
gaggaacgga aacgttatct ctgatcgcca tctcattatt attatgagac acaaagatgt 17820 
aagttatggt atctttgaaa gaaaagaaaa caggaaatta tacagaacac acacaatttc 17880 
ggagatttca ttcgaagaac ctaacccaat ttgaactcac tcccacttcc tcttgtctat 17940 
aaaacagtca atcacaggaa caggtgtctg tcttttcaaa atgtatacgt tttccgaata 18000 
atgacacaca atatcacaga caaaatgatc aatgaggttg cagaaaagaa tgcaaaaaaa 18060 
tatagaaaga gagggtgaac aggagataga gaatcaaaat ttgcatagat aaatatgcaa 18120 
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tagaaaataa caatttttga acaacaaaga aataatttag tggcatataa tatagcgatg 18180 
gaacttgcaa atttttagaa ttatcatata aaaataacaa tgtttctata ttttatgccc 18240 
tataagtctt gcagtatttc ttaaatttaa cagttcattt cttggtaatc tttartttta 18300 
tcaagaagtg ttcaggaaat tttaggacat caaattttta tttattttct aaatctactt 18360 
ttatcaaaat tttagaggtc tagtacacat ctacccaaaa agaagacttt ggagctctca 18420 
aaaaccacct agtgtatggt aaagtacatg agaagtgacg tgtctttggg cagctggcca 18480 
tctttgtcga tatgcgggtg atggtgtttc tgtgagcagt aacaggaaat tctggacacc 1854 0 
tgctagggtg tcaaaccaaa tttatttcaa cccattcttg cttcaaaaaa cccccaacta 18600 
aattattcaa attctcgtaa tttaatgaat cactcagtaa ctgtaacgtt ttttttttca 18660 
gagacaatga tcgaaagtta acaaaaaaaa ctgaggatta aacgttattt ggtatctaca 18720 
gctgacattg gaacatatca aaaagtggta agtgaaagtg aaacgaaaag tgcaacattt 18780 
gaaattgaga gtagaaaaga tcattgaagc agaaatatgg aagtgaattg aaagccgtgg 18840 
cgccaaaacg acggtcaggc gccattgaga aaattaatga gagttcggaa ggttgaaaca 18 900 
acacaaagac aacgtgaaaa attagtttgg agaagataaa aaatgtctgg agatggacga 18 960 
tttcttagtt agctgagaat agtttacatt gattttcggg aaaacgcaga atgttagaaa 19020 
aatggaaaca tgtctagact tcagataaat ttgtagaatt tatatttgta gcaaaagcac 19080 
actaacaaag gttacaaagc tattaggaaa aatacggaat gtatttttga aaatttttga 19140 
tttctctaaa ataataacac cattaatttg ctatatttgc tatatatgct atatagtatg 19200 
ttcgcattac tgagcacaaa acttggaaaa agtttaaaaa aaaaggaaac ttgttttctg 19260 
gagaaatcat taaaaacagt acaatttcag acagaaataa atctttcagt gaaagctttt 19320 
ttttgagtaa gactaagtat gcactcacaa cttttctgag tgttccaaaa atgtttaaag 19380 
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aaaatactag taaaaatgag catttcgaaa agcaatatat catacaacta cacaaacatt 19440 
tcaattaaag gaatcaattt tataatagtt ctaggcaatc ccacttttag attcaatttt 19500 
ctagcacagg gagcattgga agatataaaa acataaagat aaaggtgata aaagatccat 19560 
taaacacatc atatctatca aaccatcact tccatcaaat ccacagattt atcacaaatc 19620 
agtgtgtgac aaatataccg taatattaag ttcaaatggt ggaaaagacg cagacaaagc 19680 
ttttgcataa atactaaata attgaaagaa acgcagagaa tgtaagagaa aaatatacaa 19740 
tatgtgtatt atcaaccatc aacagttttt gattaaaacc atggagaagc gatatacagg 19800 
agcaaattag gagacgcaga ttgagaaaaa atgagaaaat aatgaaagta cggaagggtt 19860 
attgtacaat aagacaggta gcatctctca aagaacctat tgtcaagcag tttaaacatt 19920 
caacaacgtt catttatttt ttagccttca ttatgatatc tcattggttc tataattgga 19980 
ttttttaaat tcagatttct cattcatgta caagtaaagt tgttaattgg ttattatgcc 20040 
caaagtttaa ttatttgagc gcagaaaatt tgaatggaaa tttcagaaaa ctgattcatg 20100 
ctaacttcaa aaaatcctga ataaatacca attcttttcc aagtatgatt ctcgagcctg 20160 
tttacgtgcc tgcctacggt ctattttcta atttttttaa tgataaaatt ttagagtaga 20220 
tcttcaaaaa tcttccttaa aaaatctcca aaaaaatcaa gttcaggaaa actaaagtac 20280 
tccaataaaa tactcttatg caaaaacccc ccattcattt tgcagaaaaa gacaaacaag 20340 
aattaaagat aaaaagttat gatagacagg aagctgattt attagatcaa tgaatcgact 20400 
tttagttttt cttgaactct aatttgaaat agtattcgaa tgagaaaatt gaaaatatac 20460 
aaagatcaaa agttataatt gaaaatcaac aaattgatag tgtttgtata ggattaaatt 20520 
aaaatgtgcg gtacatgaga cagtagtagt agtagccata gtacgtattg gtggctccac 20580 
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tcggctactg ataatttcct tttttactga taatttgatg tcatttcgta attttatttg 20640 
tgtttccaaa aattgtgggc gtggtttatg aattggtcaa gacatgaatt aaaggaattg 20700 
taaagtaaag aagaaaatga cagaggagaa attattttcg tttgctttgg aaattgcaaa 20760 
ataaattaga ttattaaaga taatagttac ggtttaaaat aaataggtga taaaaaaata 20820 
tccaaaagtt caagtcctaa gaatcttgct attttgcaaa aaaaaagcat gagcttttgg 20880 
cctaaaaatg gcggacagct gtcgggacac tatccaagaa ttcgtgataa acgggtgaag 20340 
caccgtctct tatcatcatg ccatttttcg aattttaaac tcagactttg ataaagaaaa 21000 
ttaaaaagag agagtgtgag aaataagagt acacatggaa aatgcaagat ttgaatttgt 21060 
ttccaatttt taaaatgtat ttaaaagagt taccgttcca tttttgatta gctttataag 21120 
tggaaaaatc gtttttggat tattttttga ggaatatttt tgaatgcgct ttcaattttc 21180 
ctataaaaaa ctttgtgttc acttttttat cccgttttta trtttatttt tacaactttc 21240 
aaatttttat gaatgtttta ttgtaaaatc ataaaaaggt gcgaaacatc taaatrgcct 21300 
ggattgcatt taaaagtgca ttagcagaaa rgtattccta tggaatgttt tttgtgcaac 21360 
gagatccaga agctcgaaaa acatccaaat ttcttccaag aaagttgatg ttccaaaaar 21420 
aaaaaagatt ttagcccaat caactaaaaa aaaactctcg tttttttcat atttcacatt 21480 
ttctggtcac tttgaaggaa acactaatcc caaactgaga accgaacatg gattaaacca 21540 
tcccatttac tatttcttgt tgtcttcaaa aagtcttaga attgtgcaaa aaatagaatg 21600 
tttcgaaata ttgcggtttt cgttaaaacc ttttttgagt agattgaggg tccattagaa 21660 
ttcccaagag aacttgatga ccttcatcat caaaattagt ggtcattgaa tgtttgatca 21720 
gacaaaaatg gaaatgactg aatcggaaag agcaagaaaa tcgaaaaaaa aagtatttgg 21780 
aaattctgga aaacttttta aaatttaaga agggcaacga taagaaacag gaaattaggg 21840 



attttttagt gatggagaag tacgtgataa 
tgaatacact acgtgttttt atttatggta 
caaactgaaa taaaaatacg gaaatgtaat 
gaattttcac atcaaacggt agtgattctc 
gtcttaaagt tttcgactat aatttaagta 
tgtcctatgt agccttttct acacttgaaa 
aaattcaagt agaaagttat ttagtcttgt 
gcaataaaaa gtcagctact gtttgattta 
gttactcccg tagtacggat acagagtgaa 
agtgaattta cagagaaata attataaaac 
tgcacagcca tattgttaca cttggagtta 
caaaaactct gaaagtcaca tcaagaaatt 
ttgaattgca ttgattccca agagaaatag 
ttacatacaa atatacacac aaaaaagagt 
cacaacaata tcgttccctt ttatctctaa 
cttcggagca tcttgtgctt gctccttgta 
tatgtgtgtt taggtgttgt ttgaacaaat 
acctttttct tatgcttctt cttgttttgt 
gccatagaca aggcggcgac atttttgaaa 
aatacaacga tcacaacaac aacacaacga 



-167- 

ggttaaggtg gaacactagt gcacacgttt 21900 
gaatatagca cttaaagaac gtttttaata 21960 
tttttttttt gaaagaatcc gcctgaaact 22020 
tttatgcgtt gggtgatatg tatttacgct 22080 
atatgtttgt caaaaatcat catggtgctg 22140 
aatgataatt tttatttgaa aatggtattt 22200 
gtgccaagca ataaacacat agtctattag 222 60 
aaaacttaga ctactggtgt gcctgtgcaa 22320 
aactagtgat tgtactttag atcggctgat 22380 
ttaaaatttt tagcagctca gtcttcaggc 22440 
caaattctgc aaaccatcta ggattgaatg 22500 
ccaacaaaaa acacattaga tgccaactca 22560 
tagtaaaagt gacccctatc cattcctccg 22 620 
gtagacctct tccttctaac ccaaccaaca 22680 
ttctctgcgt ctccataagc tttgagagct 22740 
cggcggtaca gtttcctccc tctgctccct 22800 
aagtttttgg ccatccacct ccttctcaaa 22860 
gcacattttg gctcttgctt gtctgctcga 22920 
aaattatatt agtactgtta tatagtactt 22980 
aatgaaaaca tgagatcaaa agacaaattg 23040 



ttaggaggag ttggagtttc tacaatcatg 
attgctcata aaattgtgat accatgaaga 
catacttaca acctttattc tgacrtgaat 
tttgtttaaa agaaaattca attaggaaat 
gaaggcacgt gtaaatgcca cccggaaatt 
gattgtgaat tgaaaattta agagaatcat 
gattgaacag gaaagtgcag aaatatcaga 
cttttcaggt aaagaattat acagaaatcg 
cacattgcag aacggtctct gtggaaaata 
tggtcatact tacaaaaaaa acagagttta 
gaaaatggag gtctttcgcg gattcatttt 
gctttggttt aaaagtttcg caaacaaata 
caagtgaaag gaaaccattg agagcaaaaa 
tattgaatat actgatatgt gaatactgga 
ctggaagatt catttctgat tcccttgtgt 
aaaattacaa aaaataaata ctcatcttca 
ttttctttta caataaaggt aatgtaaacg 
tttaaaaata acaactttat ggtatttttc 
ttgataggaa aactatttta acattttggg 
acaactctca tttgaaagta ataaatatga 
attttatgaa tacggctctt ggtagttttt 
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aaatgtttat ctagttatta taaaactgaa 23100 
ccgaaaaact ctatgcaact gcatactgca 23160 
ttcagttttt ggtgttrgca gttattctat 23220 
aagcaataaa ttttggcatg tatttcgata 23280 
agaaaaaata agatttctca aactgaaaat 23340 
tgcaaaagta cacaaatgaa tcatttttca 23400 
ttaccgtccc aacagaaacc ggaaataaca 234 60 
raataaattt aaaacaaaag agagttatga 23520 
ggaggaggtg ctgcaaaaac tccttagaca 23580 
actaaaaatt aaattaagtg agaaaatgaa 23640 
acttcttctt ttttccactt ttcgttgcaa 23700 
aacaatgaac attgtgttga gaagacaagc 237 60 
caacaatcaa ttgaaataaa gagtaaagtt 23820 
aaaataatta gtctctataa ttggtaccgc 23880 
ctttgaccaa aactttattt ttttcagttc 23940 
tcgattcagt ggtgttttaa actcctacgt 24000 
ttccgagcgt gtagttttct ctgaaaattt 24060 
ttaaagtctt aaactgaaac cgaaacattt 24120 
aactcggcaa aagctctgca ggcttgccga 24180 
aaataaatta tcgaagtttt tttttttgat 24240 
gacgagaaaa ttacatgttg cataaatttc 24300 
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aagagttata 
cgtttgagca 
gaaacgggga 
aaaacaacag 
ttgtttttat 
gaatggtagc 
gggagataat 
gactatcaaa 
ttatgatagt 
tataagatta 
gtgttttaca 
ggaagttggg 
agacaaagac 
actatttgat 
atcgcaaaat 
ataattaatt 
gttttcttta 
agttcggcgt 
caaatatttt 
gtattattct 



actcatggag 
gaagactgta 
attgaaatac 
aacgatggtc 
tccgtacgca 
acccaatttc 
ggaaatgacg 
actacaataa 
tgggatttac 
catcaaaata 
tctcatgtag 
tgaagaaaag 
attcctaact 
tttatatcat 
aaaaagtggc 
ataaaaatgc 
ttccgttctc 
tttttgtcga 
agccagagtg 
gttgaaggaa 



accctaattt 
ggaagagcac 
gatatgcgat 
tcaggctcac 
cagaaacttt 
aaatagtgtt 
tgatgaaatg 
tactaa tgga 
tggtttcaag 
aaactaaaat 
gaaacgaaca 
aataggggcc 
cggaggtatt 
aacattatta 
cttgatgagt 
tgatacttga 
actttgagtt 
taattccatc 
aaatttatta 
catgttgcat 



ctggtttcac 
acgtcataaa 
gagttttggt 
ttgactcatc 
ttttgaggta 
tgatttgaag 
gaacggagga 
gaaaagttca 
ttcaaatgtc 
ttgaaggata 
aaatctttga 
agaatagaag 
ccaaaaactg 
atcacatggc 
cattcattca 
ggacccgttt 
tttttctgaa 
tgattatttt 
tcttattaat 
tttaaatctg 



tagaaaatca 
aattagggga 
tcgaaccggc 
tcggtgggaa 
tttttgatcg 
agacaatgaa 
aaactggtat 
ggattcttga 
aaacatctgg 
aagtaaaaca 
acacctagat 
gtcattttga 
ttccaatatt 
ttttttctta 
aaacatgcct 
ttttatattt 
aatactaaaa 
cggttttttt 
atgtttttca 
ttgttaatac 



aaaaatcaag 
tcaacgatcc 
tttgtcccaa 
caatttttat 
tgggtgggtg 
agaaacaagt 
aaatatcgtt 
agattttaca 
aagaaaaacg 
gcataatata 
aacttcaaac 
caaagtgaac 
gaagaatgac 
ggaaatttat 
aaaaaccttc 
ataaacagtt 
aaattaacaa 
acctaattat 
atttgttttg 
agcggccaca 



24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
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tgtttagaac tttataacct cgtttaaaca taaattgtat gccatattta ttgcaagtac 25560 
tacatgagtt tgaaacagta tcagatacta tattttaaac aaaaatacac attttccccg 25620 
ctatgagaga ttctgataca ttggtttcca atttttttaa aaacttgaaa ttcctcaagt 25680 
ctcccactga attacagatt tctgttctag atacctccaa agacacctag attcgacttc 25740 
ggcatcttcc tcatttttat cttcagtttc atcttttgtc taattttccg tacatttctt 25800 
tgcatcctta ccatctctcc ctctctcact cactcttctt gttcactaaa tctcaattca 25860 
aaatgttttc tgccacgtca tcatcatcat caatgccacc ttctcagagc ccattcgaaa 25920 
aattaccacg gcatcaaaat attcgatatc acgaaaaatg cttctcaatt ccacttcata 25980 
cacttaacta ttttctatgc gttattattt tttatttctt tgttttcact atattttatc 26040 
acgaacgtta tggtggaaaa cctgaaaatg ttcaagttac atcagcaatt tatgartcaa 26100 
attcaaacga actgtcatta atctttctat ttgattcttc aattcgtcga cgggaaatat 26160 
tccttggatt tggtccaaat gactcaaaaa catcaagaaa tgaaactcaa attgagctta 26220 
aaccaccacc cggatttgtt gataactcac aaatttcagt aagtttagga ttttttttca 26280 
aaaaaacttg atatgaagtg ttgaaaaatt gataattggg ccgggcttac atcagagtat 26340 
ctagttatct tgtatttcaa atattaatat tcaaacattg tagagattcg aaatgcgaca 26400 
gtacttcagt aattaccacc cacattttga ctgtcaaaaa agttcccaaa aattgtcgaa 26460 
aacttttatt aggatgtttt ctcattttgg cacgattgga gtgttttttt aacaaatccc 26520 
ttttatgcat caaattaata tctaattttt aaatcaataa tttggattaa ttcaacttgt 26580 
tttataagat tttctcgcta ttaaattagc aaaaaaaaac tatcttcaaa caattagcgt 26640 
gctttaaaac tactaggcct ttgttggcaa cgtcttttca catt-ctggca caaaactata 26700 
aactatgctc agaatttggt aatgtttgaa aatgttttgg gcaagcatat agttattcca 2 67 60 
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attctaaagt aagattagtc atctattttc cattccattt ttccattttt cacctatttt 26820 
ttccattatt taacaaccaa gactgagcaa acattttcct gttttaattt tcatatatga 26880 
aaagacataa gcaaaagctg gatcaaagct tgggcaaatc ctiattcaaag tattttccaa 26940 
cgtttccatt ccctcgtttg taaagtacaa ttggtaatct taaggcttaa ttaatrattg 27000 
tgggagattc ataatgtgaa aactaaatgt taagatttgg tcatcaattg aaaaggaaaa 27060 
accccagtct ttaactgtga atgcagaaca tccaaagtca ttgcttttac gagatcacac 27120 
aggacatcca tatttagaag taagttcaaa tcagaaatcc ccaatccatt ttttcttgta 27180 
gttaccactt caagaaccat actccgattt tcgcgacatt gttagttgtt tcagtccaat 27240 
ttatggagat tttgagatgg ttttaacagg tttaacaaat taatttggtt tcttttttaa 27300 
aacatttaat ttttatagct ttaacatcat ccatatcaat gggatcattt gttagtatac 27360 
catatgaaga gcttactgga gagctttaca agtttctacg tgtatttgaa aaaacgggac 27420 
atgtcaggtt aactgcattt ccaatgatac gtcatcagcc tcgcttcgat tcggaaaatg 27480 
aaaattatca tttgaaaatg atcaaactta aaacagattt aacgcatttg cattgttggc 27540 
taatgcataa aaaccgggcc aaattcatga tcttccaaaa ctctgctgaa attgttttac 27 600 
cgatttcctc gacgctggaa aatcccaatt acgcctctga atttacacga atatttgaaa 27660 
caccacgagt tgaaggatat gatattttag aatataatgt caaaatttca acggataaac 27720 
gcttaggcga cttttcggat ttctccatca ggcagacaat tgaagcagca aaagcagaag 27780 
aattaaccgg aaattctaaa acattaatca tgagaatggt atcacttttt ttcaaaataa 27840 
tttactgttt ctattttggc atttatttca gcattctcca actccacaga atctcttaaa 27900 
acgcggtaaa atgtatccat ttttcaaaaa tttcccatct ccaccacaag ttattccaaa 27960 



gaaaacattg gacaaattgg atacaataac 
gacacttatc aaagaatgtt cagaaaattc 
tgtaagacca tcagttagac atcgatctct 
ttctaaattt ttgaatgttg aaagtttttt 
aagaataatt ttaaattcta taattcctga 
tacattcgaa aaagctcaat aagcaaaacg 
tgttgtcatt tgaagttctg gagtgcgatg 
ttatttagca tacaacatga atttgaccaa 
cacaaaaatt ggaacacttg cgaagcgaat 
ttcaaattct cagtaaaact gaacgttttt 
tgctcgcgtc gctgaagtat tttctagagt 
aaccaaaaac tcagataaag agcataactt 
ttttcagatc acttctatca gcagtattct 
aggttgaagg agcatcaccg gaaatagagg 
taaaatcaaa ttaaacaccg aaaatatgaa 
tttgatttat tctgattttc cgccgctgat 
gagttaccat tgcgtatgcg agttcgaact 
attatacaaa tttgtattrg aaaacggaca 
atcgattttc tcatttcttg tgttttttcc 
ggtttggaca tatacggaag aatgttcgag 
tccattggaa tatgatcggg aagttcaaag 
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agaaataatt gaagaatctg atgcattctg 28020 
gaaatcttgg aaatgctcgt caagaaaatg 28080 
tcatggatgg tattcatatg atattcattt 28140 
ttgttcagat tttcaataaa cttttaagaa 28200 
atttccaact atgtttatca tttcccaaag 28260 
accacgaaat aacagtatta aaaaaaaaga 28320 
aaaagtctct cacctcggac tttctgtaat 28380 
ctcgaaataa ggttaagact gaaaattttt 28440 
tcaagacttt tcgaagttat taaacaagct 28500 
tttatgctct ccaaatcatt ttaatatggc 28560 
atgtttaata aaactaatat gtaaatgaaa 28620 
ttataacgca ttttcagaac tcttcaagct 28680 
tcttttttcc aaagacacca agaactgaaa 28740 
atgactgctt attgttcttc tttttctgaa 28800 
acatattcac taacctgaac agctttcagg 28860 
ctgctctgac ttttgaaacc gggacttgga 28920 
ggacgccgat tcttctttct gaataaacga 28980 
acatacactc cttcttccgc cgaattgctc 29040 
tggcgttcag gttcaaaagg tggagcaact 29100 
acttgaatct tttttggttg ctcaatattc 29160 
tagctgttgg atcctggagc ttgatcaaat 29220 
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ccttcgagag ttaaaagttc acgaactgct tcactcattg tgaccctttc ctcttcggca 29280 
ccagcacaga ttctatactg aaattgcttg ttgtgttgtt ttactcaaaa gaatagtgaa 29340 
caaaattttc tcaccgtaat gaatctgaca atggctggtg ggacgttagc ttcaaatggc 29400 
attcggtatc cgttctgaac acgtggtaaa acctcagcaa ctttcatucc cggataaggt 29460 
tcgattccat catggtacac ttcccaacac atgactccat aagcgaaaac atcagtcttt 2 9520 
ggagtataga acccagttct tggaacttct ggagccaacc atctaatagg aactctgaaa 2 9580 
aatttgaaaa ggttggaatt tttgacgttc tctaactttt tgtgaggatt catccgatag 29640 
ctatagcctt ctcgtgacag tccaaagtcg gatatcttta cttgtccatt cccgtagaga 29700 
caatttctgg acgcaatatc gcgatgaatt atttgaagtg aatgaagata ttcaagacca 29760 
agaccagctt gaagaaccat cgtatgtttc ttggaaattg gcaatgaacc aatgttcttc 29820 
tttagatatg aatccaaagc tccattgtca gcctaaaata atttacataa gacatttttt 29880 
cttagtaaaa taaaattaat cagttaatta attaacatac caactccatt atgaccatca 29940 
aaggttcctg tcctgcagcc acaccataaa aagtgacgac attcggatgt ttgaactttc 30000 
tcatcaatct ggcttcgtgc atgatttctt tgatctgctc ttttgtcaaa gattccaact 30060 
ttgccagctt gattgcagct tttttgacgg tatttcctat gcgaatttct cccaattgaa 30120 
cctctccaaa tgctccttct cctaatttct tgattaatgt cacgtcagaa tgttgctttt 30180 
cccacggttc acgaccaatt ggacggatga ttacagtttc tgggccctaa aagcaaacaa 30240 
atgaaaataa gtttactcac ttaatttgta agatcacccc agcaacaggt tctttagaac 30300 
gatgatagta attgagaaga tctgcgatac tagaaaacca ttttttatca actgcaaact 30360 
tgttattgtg ctctcgaatt acataatgac gaatctgaaa taatattctt aaaaattatg 30420 
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agcaatcgtt ttacgtacgt cctcaattac tccaacatag acagagagaa caaatttcct 30480 
tggctctccc acttttggat cagtaaatcg aactagaaaa tcgcctcgtt gagtgagcaa 30540 
ctgtttcata tcctcacgtg gcaataagcc atggtaccag ggttcttttg caagtacttg 30600 
c 30601 

<210> 34 
<211> 8009 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 34 

ggatccttgg ccacgccatg ggcgatgaaa ttgaccgcgt cgtaacgggt catgtcctgc 60 
tcctgcagga agaaggccgc gttcgattcc cgttccgcaa agatcgcgac aaggacattc 120 
gcccccgtca cctcggtccg gcccgagctt tgcacatgga tcgcggcgcg ctggatcacc 180 
cgctggaagg cggcggtcgg cacggcttcc gagccttcga cttcggtgat cagcgtcgag 24 0 
agatcatcgt cgatgaactc ggtcagggtg gtgcgcaact cgccaagatc gacgccgcag 300 
gcgcgcatca cgcggctggc gtcgggctcg tcgarcagcg cgacgagaag atgttcgagc 360 
gtcgccagtt catgtttgcg cgtgttggcc agcgccagtg cggcgtgaat tgcttgctcg 420 
agcgtggtcg aaaacgaagg catgcggcgc tcctttcctc gggtctcccg atactggcct 480 
catgtgatta agtttcggtg gatttcgccg cgcttcaagg cccggacgcg tgttttttcc 540 
acctctcgcc gctctgttgc aaaactgacc agcgcggcgg gctttcgcgc gatccgcagg 600 
cagcgcgcga aagtgctttc agaaccggtc cttgcgcgcc cgcgccgcgg tgaagacggc 660 
gagcggcgcg gcatccccgg gcaggccgag cgcggcgcgc aacgcagcgt catcggcgcg 720 



caaaaaggga ttcgttgccc gttcctcgcc 
cagccgcaag gccgtcaccc ggtccatccg 
gctgagcgcg aaccggccgt tcgcggcggt 
gggcggcagc gcggccagac gggtcagcgt 
gagacgcccg cagccccagc tcatcaggct 
cagataccag gcgatatggc cgagcgtatg 
catccccaga tgcagcacgt cgcccggggc 
ggcatcggcc gcggcccccg ccaccttggc 
cgcgatgtga tcggcgtggt gatgggtgat 
ggtcagcacc ttcagcaccg gggccgcctc 
ggcggtgtcg tgccagagcc aggcgtaatt 
gagggtcatg gccttttgcg catctttcgc 
caacctgcaa tgcatctcga cgtgctcgac 
gggcgcacgg cgcaaaaggc gatccgcgac 
tccggcatgg ccgggctgac ggtggcgggc 
tatctgggcc gggcgcggcg ggtgatcggg 
tggcccgccg gagagcccaa tgtctcggtg 
accgggatga tcgaccggct ggtggtgctg 
gcgctgatgg aggaatgctg gcgcacgctg 
ccgaaccggg tcgggctttg ggcgccgcgc 
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caaagtcacc ggcaaactgg gttccccggc 780 
gtcgtgcagc cgaccgttcc ccggttccag 840 
gtattcatgc cccgaacaga cccgggtttc 900 
gtcgaacatc tgcgcggggg tcccctcgaa 960 
gtcgccggaa aagagcagcc ccgccccggg 1020 
gccgtcggcc gcgatcacct gcgcggcctc 1080 
caccggatga tcgagcggcg gcagccggtg 1140 
cccggtcgcc tgcgccagcg cctcgacccc 1200 
caggatgtgg tgcagctgcc agcgccggtc 1260 
ggggacatcg accaccacca cggtatcggt 1320 
gtcggtcagg caggggatcg gggtcagttc 138 0 
tatcctgacc cagcttcgcc caaggaaggc 1440 
ctgcgtgatt tctactaccg cacccaattg 1500 
aaggtggtcg aactctggcc ggacacccag 1560 
tacggcttcg cggtgccgct gttgcgcccc 1620 
ctgatgcccg cgcagcaggg cgtgatgccc 1680 
ctctgtgccg aaaccagctg gccgctggag 1740 
cacgggcttg aagtctccga cgaccccgat 1800 
ggccccggcg ggcgggcgct gttcar,cgtg 18 60 
gaaaccacgc ccttcggctt tggccgcccc 1920 
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tatacgatgg gccagctcga ggcgcaggca cgacgggtgg ggtttgcccc cgaacgtcag 1980 
gcggcggcgc tgtacattcc gccctcgcag cggcggttct ggctgcgctc ctccgagatg 2040 
tgggaacggc tgggcacaag ggcggcgggc tatctggcgg cgggggtggt gatgcttgag 2100 
gtgatcaagc aggtgcattc ggtgcgccgc tcggggcttg gcgcggcggt gcgcaagccg 2160 
ctctcgatcc ttgaaggggc gcccaagccg gtggtcgggc ggatgtgagc cgcccgcggc 2220 
cgcaagaatc gcccggccgg aaaagcccgt ttccgcggca cttcgccctg cggcggggaa 2280 
acgcagcggg gcgggcttcg accctttgcg ctaacactcc gtgccggtgc agaaaatgtg 2340 
ccagcctgat gcggattcct gccgccaaga tggttgcgag ggtcttgatg ctctgctaga 2400 
cgcaaccccg aatgcggcgt gcgagatcat tttgggcgcc gaggggggcc tctgaatcgg 24 60 
tgacggaacg attggttccg gtgtccgcgt gcggaggcaa aagcatcgga agggtggacg 2520 
tgtccgaacc agcttcgatt tccgcagcca ttgccgggcg ttatgccacg gccatcttcg 2580 
acctcgcgca ggaggccaag ggcatcgacg cgctctcggc cgacgtggac gcgctgacgg 2640 
ccgccttggc cggttcggcc gagctgcgtg acctgatttc ctcgccggtc tacacccgcg 2700 
aggagcaggg ggacgcgatc gccgcggtgg ctgcgaagat gggcctgtcg gcgccgcttg 27 60 
ccaacggtct gaaactgatg gcgacgaagc gccgtctgtt cgcgctgccg cagctgctca 2820 
agggcctggc cgccgcgatc gccgaagcca agggcgagat gaccgcggat gtcacctcgg 2880 
ccaccgcgct gagcgcggcg caggccgaga agctggcggc gacgctggcg aaacagacgg 2940 
gcaagaccgt caaactgaac gtcgccgtcg atgaaagcct catcggtggc atgatcgtca 3000 
agctgggttc gcgcatgatc gacaccacgg tcaaagccaa actcgcttcc cttcagaacg 3060 
ccatgaaaga ggtcggataa atgggcatcc aagcagcrga gatttctgcg atcctcaagg 3120 
agcagatcaa gaacttcggg caggatgccc aggtcgccga agtgggccgc gtgctctcgg 3180 



tcggtgacgg gatcgcgcgc gtgcacgggc 
aattccccgg cggcatccgc gggatggcgc 
tgatcttcgg gtcggaccgc gacatcaagg 
tcgtggacgt tccggcgggc gaaggcctgc 
cgatcgacgg caagggcccg atcgtggcga 
cgggcatcat tccgcggaaa tcggtgcatg 
acgcgatgat cccgatcggc cgcggccagc 
gcaagaccgc gatcgcgctc gacaccattc 
cgggcaacaa gctgcactgc ttctatgtcg 
agctggtgaa gaagctcgaa gaagccggcg 
ccgcttcgga cccggcgccg atgcagttcc 
aatacttccg cgacaacggc atgcacgcgc 
ccgtggccta tcgtcagatg tcgctgctgc 
cgggcgacgt gttctatctg cactcgcgcc 
atttcggttc gggctcgctg accgcgctgc 
cggccttcat cccgaccaac gtgatctcga 
aactgttcta ccagggcatc cgcccggccg 
gttcgtcggc ccagaccaac tcgatgaagt 
cgcagtatcg cgaaatggcc gcctttgcgc 
aaaagctgct gaaccgcggt gcccgtctga 
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tcgacaacgt ccaggcgggc gagatggtcg 3240 
tgaaccttga agtcgacaac gtcgggatcg 3300 
aaggcgacac cgtcaagcgc accaacgcca 3360 
tgggccgcgt cgtggacggc cttggcaacc 3420 
aagagcgtcg catcgccgac gtcaaagccc 34 80 
agccgatggc gaccggcctc aagtcggtcg 3540 
gcgagctgat catcggcgac cgtcagaccg 3600 
tgaaccagaa gtcgtacaac gacgccaacc 3660 
ccatcgggca gaagcgctcg accgtggcgc 3720 
cgatggaata caccaccgtc gtcgccgcga 3780 
ttgcccccta ttcggcgacc gcgatggcgg 3840 
tgatcatcta tgatgacctc tcgaagcaag 3900 
tgcgccgtcc gccggggcgt gaagcctatc 3960 
tgctggaacg ttcggcgaaa ctgaacgagg 4020 
cggtcatcga aacccagggc ggcgacgtgt 4080 
tcaccgacgg tcagatcttc ctggaaaccg 4140 
tgaacaccgg tctctcggtg tcgcgcgtcg 4200 
cggttgccgg tccggtgaaa ctggagcttg 4260 
agttcggttc cgaccttgac gccgcgacgc 4320 
ccgagctgat gaaacagccg caatartcgc 4380 



cgctgaccaa cgccgaaatc gtggcggtga 
ccgttccggt gaaggaagtc ggccggttcg 
cccgcaagga cgtgcttgag tggctcacca 
agaagaagct caaagacgcg atcgccgagt 
ggacagggag atgcccagcc ttaaggacct 
tcgcaagatc acgaaagcga tgcagatggt 
aagcgccgaa gctgcccggc cctatgccga 
cggtgcggtg ggctcgaccg acggtgcgcc 
ggtccatctc ctcgtcatca tgacgggcga 
tatcgcgaaa ctcgcgaagg cgaaggcgat 
gatcctcacc gtcggcaaga aaggtcgcga 
catcgatcac atcgacctga gcgacgtgaa 
ttcgcaaaac atcatcgacc gcttcgaggc 
ctcggtcttc cagagcgtga tcagccaggt 
gttcgaaacc gatgcggcct cggcctcggc 
aatcctgacc gcgctgctgc cgcgtgcggt 
aaacaacgcg tccttcaacg gggcgcagat 
gggtgacatg atcgatcgct tgaccatcga 
caaagagctc atcgaaatca tctcgggcgc 
gagaatggca agcaaaggca aagtgaccca 
cgaagacggc ctcccggcga ttctgaacgc 
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tctttgcggg caccaacggc ttcctcgatg 4440 
agaaaggcct gctggcctat ctgcgctcga 4500 
aggaagaccc caagatcaag ggcgacgccg 4560 
tcgccaagac cttcgcttga cggcctgaaa 4620 
caagaaccgg atcgtgagtg tcaagaacac 4 680 
cgcggcggcg aacattcgcc gcgcccagga 4740 
gcggatgaac gccgtgatgt cgagccttgc 4 800 
gcgcctactt gcgggcacgg gctccgacaa 48 60 
gcgcgggctt tgcggcggct tcaacgccaa 4 920 
ggaactgctg gcccagggca agacggtgaa 4 980 
cgcgctgcgt cgtgatctgg gccagtatta 504 0 
gaaactgagc tacccggtgg cgcagaagat 5100 
gggcgaatac gatgtggcga cgatcttctt 5160 
gccgaccgcc aagcaggtga tcccggcgca 5220 
ggtttacgac tacgaaccgg gcgatcagga 5280 
ggccacggcg atctttgccg cgctgctgga 5340 
gtcggccatg gacaacgcca cccgcaacgc 5400 
gtataaccgc tcgcgtcagg ccgccatcac 54 60 
cgaggcgctc tgacggaacc ggagatagaa 5520 
ggtcarcggc gccgtcgtcg acgtgcagtt 5580 
ccttgaaacc accaacaacg gcaagcgcct 5640 
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cgttctcgaa gtggcgcagc acctgggcga gaacaccgtc cgcaccatcg cgatggacgc 5700 
gaccgagggt ctcgtgcgcg gcgcggccgt gtccgacacc ggcggcccga tcaccgttcc 57 60 
ggtgggcaac gccaccctgg gccgcatcct gaacgtcatc ggcgagccgg tggacgaacg 5820 
cggtgacgtg tcgaaagccg aagcccgggc gatccaccag cccgcgcccg atttcgcggc 5880 
gcagtcgacg gaaagccaga tcctcgtcac cggcatcaag gtgatcgacc tgctcgcccc 5940 
ctattccaag ggcggcaaga tcggtctctt cggcggcgcc ggtgtgggca agaccgttct 6000 
gatcatggaa ctgatcaaca acatcgcgaa agtgcactcg ggcttctcgg tgttcgcggg 6060 
cgttggcgaa cggacccgtg agggcaacga cctttaccac gagatgatcg aatcgggcgt 6120 
tatcaacctc gagaagctcg aagaatcgaa agtggcgctg gtctacggcc agatgaacga 6180 
acccccgggg gcccgtgccc gcgtggcgct gaccggcctg accctggcgg aacagttccg 6240 
cgaccagtcg ggcaccgacg tgctgttctt cgtcgacaac atcttccgct tcacccaggc 6300 
cggttcggaa gtgtcggcgc tccttggccg tatcccctcg gccgtgggct accagccgac 6360 
gctggccacc gacatgggcg cgctgcaaga acgcatcacc tcgaccaaag ccggttcgat 6420 
cacctcggtt caggccatct acgttccggc cgacgacctt accgacccgg ccccggccac 64 80 
gtcctttgcc cacctcgacg ccacgaccgt tctgtcgcgt gcgatctcgg aactcgggat 654 0 
ctacccggcc gtcgacccgc tcgactccac ctcgcggatc cttgacccgc aagtcgtcgg 6600 
cgaagagcac tatcaggtcg cccgtgacgt ccaagggatg ctgcaacgct acaagtcgct 6660 
gcaggacatc atcgccatcc tcggcatgga cgaactgtcg gaagaagaca agctgacggt 6720 
ggcccgcgcc cggaagatcc agcgcttcct gtcgcagccc ttcgacgtgg cgaaagtctt 6780 
caccggctcg gacggcgtgc aggttccgct cgaagacacc atcaagtcgt tcaaggcggt 6840 



ggttgcgggc gaatacgacc acctgccgga 
tgacgtgatc gcgaaagccc agcgcctcgc 
taccatgcag ttcgatctcg tgtcgccgga 
ggtccgtctt cccggcgtgg aaggcgatct 
cctctcgctg cgtcccggca tcctgaccgt 
cgtgaccggc ggcttcgccg aggtttcggg 
tctgacccgg gcggaactga ccgccgcggt 
agtcgcggac gccgcgcatc cgtcggtggc 
ggaagcgctt ggctcgcaca tcaatctctg 
cggtcatcgc gccggggccc ttgctttttg 
gaaggtgcag gcagccggag tgagcgacag 
atcggcgtgg cccagggatc gctggtgctg 
tggacgggcg agggcccgcg cgaattgcgc 
gagatcccgg cggtgcaggt gtcgctgtcg 
cgcatggaca tttccgccga catggtgacg 
tggggcgaca cccgcgtcgc ccgcgtccgc 
aatgacgacg actgggacgt ggcctgatcc 
cgatggtgcg cgcgactttc ccatccaacg 
ctggcagctc gtggtgatcc tgtgggggac 
gatcgagacc gtgtggcccg ggcctcgag 



-180- 

agcggccttc tacatggtcg gcggcatcga 6900 
cgctgcggcg taagggggaa ccatggccga 6960 
acggcggctt gcctccgttg ccgcgagcga 7020 
gacggcgatg ccgggccatg cgcccgtcat 7080 
ggtcagcgcc gcgggcacgg ccgaatacgc 714 0 
cgagaaggtg accgttctgg ccgagcgcgg 7200 
tcatgccgag atgctggccg aggccaagaa 7260 
cgatgccgcc gcgaagatgc tggccgacat 7320 
acgggacatc ccgccggata tctcgggccc 7380 
cttttgtctt gccgcgccgc atattagcgt 7440 
gaacggatga agaagttttc ctcgacccgg 7500 
ttttcggatt atctggacgg cggcgtgatg 7560 
aggctggtgg tgttcgacga agccttccgc 7620 
atgtgggaca tcgaccagaa gcacaatccg 7 680 
gccgagggct tcgtgatcgt ctttcgcacc 7740 
gcggactggc tggcgatcgg cggctgcgcc 7800 
cgcccggctt gactttccgc ccccccgcgc 78 60 
aggcccgccc gtgcaacaag atgccccccg 7920 
gaaatatccg gtcgccgaac tcaacgccct 7980 

8009 
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<210> 35 
<211> 9810 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 35 

gatatcgggc ttgtcatttt cgattgcgac ggggttctgg ttgattcgga agttctggcc 60 
gtggccgtcc tcatcgcaga actggaccgg gcgggcgtgc gggtcgacga ggccttcgtg 120 
catcggcatt ttctgggccg gagcttcccg gctgtrcagg aggtcgtgca gcgccagttc 180 
ggcgtgaccc tgcccgagac cttccaggtc gaggaacgtg cccggctgct gtcagccttc 240 
gagaccggcc tgcgggccat gctcggggcc gcggagaccg tccgcgcgct gtcggtgccc 300 
tactgcctcg ccacgtcgag cacgccggcc cggctcacgc gctcgctgga gatcacgggc 360 
cttgcggccc tcttcgaggg acgctgcttc accgcgagcc aggtggcgcg cggcaagccc 420 
gcgcccgatc tgttcctgct cgccgcggcc gagatgggcg tcgcgcccga acgctgcctc 4 80 
gtgatcgagg ataccgagcc cggcgtgcgc gcaggcctcg cggccgggat gcaggtctgg 540 
cgcttcaccg gcggtagcca tttcgcgaac cgatcccccg aggatgcgcc cgatgccctg 600 
ccgcaccggc ggttcgacag cttcgaccgt ttctacgaga ccctgcccgg cctgcgccgg 660 
gccaagtgcg agaccctgac atgatcgacc ggcccgagag cgagccgacg cccctcgacg 720 
atgccgcgcg cgcgggctgg ctctattatg tcgcaggcct gactcaggat cagatcgcgc 780 
gggagctcgg cacctcgcgt cagcgggcgc agcggctggt gagccgggcc atctccgaac 840 
ggctgatcca cgtccggctc gagcaccggg tctcgggctg cctgcarctg gaagccgcgc 900 
tcctccggcg cttcgggttg aagctggccc gcgtggcgcc gagtctcggg tccgaggtgg 960 



atcccctgcc ctccatcgcc cccaccgccg 
agcggccgat ggtggtggcc ttcggcaccg 
tgacctcgat ggtctgcgaa cagcacaaga 
atggctcggc ctcctactac gatgtgatct 
actatccgat gccgatgccg gtcatcgcgc 
cgctaaagcc cgtgcagtcg gtgctgcggc 
ggctgggaca gatgggcgag gacgcgccgc 
agctgaccga gatgcaggat ctgggcgccg 
cggagggtcg ctacctcgaa accagcatca 
tttccgagga tcggacggtg gtcgccatcg 
acgcaggctt aaggggccgt cttttcaacg 
cacttctgtc ctgaagccgc cgaaaggcgc 
tcggtgagta attattcgcc gcacgaaata 
gatgaccgca agatttcgcg ccctgatggg 
cgccggcgcc gaaaccatca ccgtggcgac 
ggggctcatg tccgagttca acgcgcagca 
cgaggaaaac gtgctgcgcc agaaggtcac 
cgacgtgctg accatcggca cctacgaggt 
gagcctgaac gacctgccgc cggagtatga 
cggcctgacc gtcgacggcg agctctatgc 
catgtatcgc aaggacctga tggagaaggc 



-182- 

ccgccgaggt ggagcgggtg ctgcgctcgg 1020 
gccggtcgct gcgcgccacc gtcgaggaga 1080 
tcgtgtcgct caacggaaat atttctgcgg 1140 
tccgcatcgc cgaccgtgtg cgtgcgccgc 1200 
aggatgcggc ggagcgggag ctgtttcatg 12 60 
ttgcgcgcaa tgccgatgtg accttcgtcg 1320 
tcctgaagga cgggttcatc acgcccgagg 1380 
tcggagaggt ggcgggatgg gtcttcgact 1440 
atcagcgggt tgcgggcgtc cgtgtcgaac 1500 
ccggtggcag gcgcaagctc gcggcgctgc 1560 
gcctgatcac cgacgagttc acggcgcagg 1620 
ggcaaaaagt atttgacagg ctggcacccc 1680 
atgctcaccg tgcaggccag ggaggatact 17 40 
cgcgtgcgcc gtggctgcgc tctcgtccgc 1800 
tgtcaacaac ggcgacatga tccgcatgca 18 60 
ccccgacatc accgtcgagt gggtgacgct 1920 
gaccgacatc gccaccaagg gcgggcagtt 198 0 
tccgatctgg ggcaagcagg gctggctcgt 2040 
tgccgacgac atcctgcccg cgatccgcaa 2100 
cgcgcccttc tacggcgaga gctcgatgat 2160 
ggggctgacc atgcccgacg cccccacctg 2220 
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ggacttcgtg aaggaagcgg cgcagaagat gaccgacaag gatgccgagg tctacggcat 2280 
ctgcctgcgc ggcaaggccg gctggggcga gaacatggcc ttcctcagcg ccatggccaa 2340 
cagctacggc gcgcgctggt tcgacgagaa ctggcagccg cagttcgacg gcgaggcctg 2400 
gaaggccacg ctgaccgact atctcgacat gatgacgaac tacggcccgc ccggcgcctc 24 60 
gaaaaacggc ttcaacgaga acctcgcgct gttccagcag ggcaagtgcg gcatgtggat 2520 
cgacgcgacg gtggccgcct ccttcgtgac caaccccgag gaatccacgg tggccgacaa 2580 
ggtgggcttc gcgctcgccc ccgataccgg caagggcaag cgggccaact ggctcggggc 2640 
ctggaacctc gcgatcccgg cgggctcgca gaaggtcgat gccgccaagc agttcatcgc 2700 
ctgggcgacc tcgaaggact atgccgagct ggtggcctcg aaggaaggct gggccaacgt 2760 
gcctccgggg acgcggacgt cgctctacga gaacccggaa tatcagaagg tgccgttcgc 2820 
gaagatgacg ctcgacagca tcaacgcggc tgacccgacc cacccggcgg tcgatccggt 2880 
gccttacgtc ggtgtgcagt tcgtggcaat ccccgagttc cagggcatcg gcaccgccgt 294 0 
gggccagcag ttctcggcag ccctcgcggg ctcgatgtcg gccgagcagg cgcttcaggc 3000 
ggcccagcag ttcacgacgc gcgaaatgac ccgcgcgggc tacatcaagt gagcccttcc 3060 
gcgggccggc cctgagcggc cggcccgcac cgcttgccgc ttccggccgt atccgccgga 3120 
ggcctttccg ccccatcagc cccgaggcct ccatggcgac ccagcattca aagactgcgg 3180 
cgcgtctgat gatttccccg gccgtgatcc tcctgttcct gtggatgatc gtgccgctgt 3240 
cgatgacgct ctacttcagc ttcctgcgct acaacctcct catgccgggg atggagagct 3300 
tcaccggctg ggacaattac tattacttcc tgaccgatcc ggccttctcc gcggccctga 3360 
ccaacacgat cctcctcgtg gtcggcgtcc ttctcatcac cgtggtgggc ggggtcctgc 3420 



tcgcgctcct gctcgaccag cccttctggg 
ctcccttctt cgtcatgccc accgtctcgg 
ccgtgaacgg gatgttcgcc catatcgccc 
tgtcgcaggc gccgctggcc tcgatcatcg 
ccacgctgat ccttctgacg gcgctccagt 
agatggacgg cgcctcggcg ctcgaccggt 
gtgccatcac cgtggtggtg ctgatccaga 
tcctcgtcac gacgaacggt ggacccggca 
atgcgcagtc gctcctgaat tacgacgtgg 
tggtgctcgc caatatcgtg gcgatcttcc 
cctgacatgt cacgccgcac ctcaacccgc 
acgatagcct tcctcatctt cttcccgatc 
gaaggagacg ccatcaaggc gcccttcgcc 
tacgccgatg tgcaggaacg gtcgaactac 
tcgctgggct cgaccctcgt ggcgctcgcc 
ttcgtgccgg gccggcggac gaaggacgtg 
ccggcggtgg gcgtgctcat cccgctctat 
acgcggatcg gcctcgtgat cgtgctcacg 
ctctacacct acttcaagga gatcccgggc 
gcgacgctcg gctccgagat cctctatatc 
tcgacgctgc ttctgaacgt gatcctcgcc 
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ggcagggcat cgtgcgcgtg ctggtgatcg 3480 
cgctggtctg gaagaacatg ttcatgaacc 354 0 
gcgggctcgg ccttccgccg ttcgacttcc 3600 
gcatcgtggc ctggcagtgg ctgcccttcg 3660 
cgctcgaccg cgagcagatg gaggcggccg 3720 
tcatccacat caccgtgccg cacctgacgc 3780 
ccatcttcct tctgggcgtc ttcgccgaga 3840 
ccgcctcgac caacatcacc tacctcgtct 3900 
ggggcgggtc ggccggcggc atcgtcgccg 3960 
tgatgcgcat gatcggcaag aatctggacg 4020 
cgcacgctga tcgtcacgct cgccgcctgg 4080 
ctctggacgg tgctgatgag cttcaaatcg 4140 
atgctcttct cggactggac cctgcaatcc 4200 
gcccgccact tcatgaattc ggtggtgatc 4260 
atcgcgatcc ccgccgcctg ggccatggcc 4320 
ctgatgtgga tgctgtcgac caagatgatg 4380 
ctgatcttcc gcgacacggg ccttctcgac 4440 
ctcatcaacc tgccgatcgt ggtctggatg 4500 
gagatcctcg aggcggcgcg gatggacggg 4560 
ctcacgccga tggccgtgcc gggcatcgcc 4 620 
tggaacgagg ccttctggac gctgcagctg 4 680 
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accacctcgc gggcggcccc gctcacgcag ttcatcgcga gctattccag ccccgagggc 4740 
ctcttctacg ccaaactgtc ggcggcctcg accatggcca tcgcgccgat cctgatcctt 4800 
ggctggttca gccagaaaca actcgtccgc ggcctgacct tcggcgcggt gaagtgagga 4860 
ccacatgggc aagataaccc tgcgcaacgt ccagaagcgg ttcggtgagg cggtcgtcat 4 920 
cccctcgctc gacctcgaca tcgaggatgg cgagttcgtc gtcttcgtcg gcccctcggg 4 980 
ctgcggcaaa tccacgctcc tgcgcctgat cgcgggcctc gaggatgtgt cggacggcca 5040 
gatcatgatc gacgggcgcg acgccaccga gatgccgccc gcgaagcgcg gcctcgccat 5100 
ggtgtttcag agctacgcgc tctatccgca catgacggtg aagaagaaca tcgccttccc 5160 
gctgcggatg gcgaagatgg agccacagga gatcgagcgg cgcgtgtcga acgcggccaa 5220 
gatcctgaac ctcaccaact atctcgaccg ccgccccggc cagctctcgg gcgggcaacg 5280 
gcagcgggtg gccatcgggc gcgccatcgt gcgcgagccg gcggccttcc tgttcgacga 534C 
gccgctctcg aacctcgatg cggcgctgcg ggtcaacatg cggctcgaga tcaccgagct 5400 
gcaccagtcg ctcgagacca cgatgatcta tgtcacccac gatcaggtcg aggccatgac 5460 
catggccgac aagatcgtgg tgctgaacgc gggccggatc gagcaggtgg gctcgcccct 5520 
caccctctac cgcaatccgg cgaacctctt cgtggcgggc ttcatcggca gcccgaagat 5580 
gaacctgatc gaggggcccg aggccgccaa gcacggcgcc accaccatcg ggatccgccc 5640 
cgaacatatc gacctgtcgc gcgaggcggg ggcgtgggag ggcgaggtcg gcgtctcgga 5700 
acatctcggc tcggacacgt tcctgcatgt gcatgtcgcg gggatgccca ccctcaccgt 57 60 
gcggacgggc ggagagttcg gcgtccatca cggcgaccgg gtctggctca cgccgcaggc 5820 
cgacaagatc caccgcttcg gcgccgacgg aaaggcgctc tgacatgcgg ctcgacggca 5880 
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agaccgccct catcaccggc tcggcgcgcg gcataggccg cgccttcgcc gaggcctatg 5 940 
tgcgtgaagg cgcgcgcgtg gccatcgccg acatcaacct cgaggcagcc cgcgccaccg 6000 
cggccgagat cggccccgcg gcctgcgcca tcgccctcga cgtgaccgat caggccagca 6060 
tcgaccgctg cgtggccgag cttctcgacc gctggggcag catcgacatc ctcgtgaaca 6120 
atgcggccct cttcgatctg gcgcccatcg tcgagatcac ccgcgagagc tacgaccggc 6180 
tgttcgcgat caacgtctcg ggcacgctct tcatgatgca ggcggtggca cgggcgatga 6240 
tcgcgggcgg ccggggcggc aagatcatca acatggcaag ccaggccggc cgccgcggcg 6300 
aggcgctggt gggcgtctat tgcgcgacca aggccgccgt catctcgctc acccagagcg 6360 
cggggctgaa cctcatccgc cacgggatca acgtcaatgc catcgccccg ggcgtggtgg 6420 
acggcgagca ctgggacggg gtggatgcga agttcgccga ctacgagaac ctgccccgcg 6480 
gcgagaagaa gcgtcaggtc ggcgcggcgg tgcccttcgg ccgcatgggc cgcgccgagg 654 0 
acctgaccgg catggcgatc ttcctcgcca cgcccgaggc cgactacatc gtggcccaga 6600 
cctacaacgt ggacggcggc aactggatga gctgaggccc aaggcccggc cctccccccg 6660 
tcgaacgcgc cccctatccg aggtaatccc atgacccgct ccgtcacccg tccctcctat 6720 
gaccgcaagg cgctcactcc cggcatcgtc catatcggcg tcggcaactt ccaccgggcg 6780 
catcaggcgg tctatctcga cgatctcttc gcgctgggcg agggccacga ctgggccatc 684 0 
ctcggcgcgg gcgtccgccc gaccgatgcg cggatgcgcg aggctctggc cgcgcaggac 6900 
aatctctcga cggtgatcga gctcgatccg gcgggccacc gggcccggca ggtgggggcg 6960 
atggtgggct tcctgccggt cgaggccgac aatgcggccc tgatcgaggc catgtcggat 7020 
ccgcgcatcc gcatcgtctc gctgaccgtg accgagggcg gctattatgt cgatgcctcg 7080 
ggcgccttcg atccgacgca tcccgatatc gtggccgatg cggcccatcc tgcgcggccc 7140 
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gcgaccgcct tcggcgcgat cctcgccgcc ctccgcgccc gccgcgacgc gggggttaca 7200 
cccttcaccg tgatgtcctg cgacaacctc cccggcaacg gccatgtcac ccgcaacgcc 72 60 
gtggtgggcc tggccgagct ctacgacgcc gagcttgcgg gctgggtgaa ggcgcaggtg 7320 
gccttcccga acggcatggt cgaccgcatc acccccgcca ccggcccgca cgagcgcgaa 7380 
ctggcgcagg gcttcggcct cgccgatccg gtgcccgtca cctgcgagcc gttccggcag 744 0 
tgggtgatcg aggatcattt ccccgccgga cgccccgcgc tcgagaaggt gggcgtgacc 7500 
ttcaccccgc atgtccatgc ctacgaggcg atgaagatcc gcatcctgaa cgggggccat 75 60 
gcggtgatcg cctatccgtc ggcgctcatg gacatccagc tcgtgcacgc ggccatggcc 7 620 
catccgctga tcgcggcctt cctgcacaag gtcgaggtcg aggagatcct gccccatgtc 7 680 
ccgcccgtgc ccgacaccag catccccgac tatcrtaccc tgatcgagag ccgcttctcg 7740 
aaccccgaga tcgccgacac gacgcgcagg ctctgcctcg acggttcgaa ccggcagccg 7800 
aagttcatcg tgccgtcgct gcgcgacaat ctggcggcgg gcacggtgcc gaaggggctg 7860 
gtgctgctct cggcgctctg gtgccgctac tgcttcggca cgacggactc gggcgttgtg 7 920 
gtcgagccga acgatccgaa ctggacggcg ctgcaggacc gggcgcggcg ggcgaaggag 7 980 
acgccggccg agtggctggc gatgaccgaa gtctacggcg atctggcgca gaacgatctt 8040 
ctggcggccg agttcgcggc agccctcgag gcggtctggc gcgacggggc cgaggcggtg 8100 
ctgcggcgct tcctcgcggc ctgatccgca gggcccagcc gctcggagca ccgaagcgga 8160 
gcccctgccc cttgcggcgc accgtgaggc gaaacgaccg ggccaccccg gggccaccgc 8220 
ctcggtaaca ccatggtatc gcgcaagaat gccggcgcct ctgccgaacg ggcccggctg 8280 
ccgggcgagg cgccggactt gtcaaggcgg cggccctcgg gtagagaggg cgggcgtggc 834 0 
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cccgttagca cagtggtagt gcagcgctct tgtaaagcga aggtcgttcg ttcaaatcgg 84 00 
acacggggca cgcgatcctc cctccgcatc ggcgctcgcc cccggtctgg actgcctctt 8460 
cggaaggcac ctgcccgctt gtgcgccgcg ccctttcctc gcttcccaag cgtctgtcac 8520 
ggcttgcgga aagccgtgcg cctcggttct ggacagccgc cccttgcggt gtaatctgcc 8580 
ctcagcgcgc agccggcgga cagaagccgg cccgccacgt ccacaaggga ggaatgccat 864 0 
ggatcgtcgt tcattcatca ccaaggccgc cgtgggaggg gccgccgcga gcgccctcgc 8700 
cgcgccggcg cttgcccagt ccgcgcccaa ggtcacctgg aggctcgcct cctccttccc 87 60 
gaaatcgctc gacacgatct tcggcggcgc cgaagtgctg tcgaagatgc tctccgaggc 8820 
caccgacggc aacttccaga tccaggtctt ctcggcgggc gagctggtgc cgggcctgca 8880 
ggccgccgac gccgtgaccg agggcaccgt cgaatgctgc cacacggtcg gctactatta 8 940 
ctggggcaag gatcccacat tcgcgctggc cgcggccgtg cccttctcgc tgtcggcgcg 9000 
cggcatcaac gcctggcact accatggcgg cgggatcgac ctctacaacg atttcctcgc 9060 
gcagcacaac atcgtggcct tcccgggcgg caacaccggc gtgcagatgg gcggctggtt 9120 
ccggcgcgag atcaacaccg tggccgacat gcagggcctg aagatgcggg tcggcggctt 9180 
tgcggggaag gtgatggagc gtctgggcgt cgtgccgcag cagatcgcgg gcggcgacat 9240 
ctatccggcg ctggagaagg ggacgatcga cgcgaccgaa tgggtcggcc cctatgacga 9300 
cgagaagctc ggcttcttca aggtggcgcc ctactactac tatcccggct ggtgggaagg 9360 
cggcccgacc gtccatttca tgttcaacaa gagcgcctac gaggggctga ccccggccta 9420 
tcagtcgctg ctgcgcaccg cctgccacgc ggccgatgcg aacatgctcc agctctacga 9480 
ctggaagaac ccgacggcga tcaagtcgct ggtggcgcag ggaacccagc tcaggccctt 9540 
cagccccgag atcctgcagg cctgtttcga ggccgcgaac gaggtctatg ccgagatgga 9600 
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agcctcgaac cccgccttca agaagatctg ggactcgatc aaggccttcc gctccgagca 9660 
ctacacctgg gcgcagatcg ccgaatacaa ctacgacacc ttcatgatgg tgcagcagaa 9720 
cgccggcaag ctctgagccc gagcgccgcg cgaaagagga ccccggagcc gcgttccggg 9780 
gtcttttcat gggcgacagg ggccggcgcg 9810 

<210> 36 
<211> 1886 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 36 

tgagtgtcta ttttttttcg ggttttttta agtgtgaatc acatggttag gagcagttgt 60 
cttcaatgtg accaaccatc ccaaggctct aattcaacgt ttgggtgtgg gggcccgctg 120 
gcagctgtgt gtgccactgg gctgttggtg ttggtgcttt actccccctc atcgcaaacg 180 
gctaattggt cggcacaggg tatttccaca aaggcgctgt atccggcagt gcctgtgcct 24 0 
tccactctgc tgcctggaag cgcgcctgcc aaacaccagc tgcatgtttg gagggcacat 300 
gcgatgtcgg aggccacaac aaacaattca ttcaaacagt cattatttgg gtacaatgcc 360 
atctcctcca tttggcttca actggctggt gtggccgcca ctttctttgc atttggagct 420 
ttgatggcag ctgtaacgca acgcaaggag atcgccgtct tctccgcctc gggtcaggct 480 
gctgagccgg agggggcgga gcccctgaag cggccttttc cgtctcctgc tgccaaacct 540 
aagccgctct tctccacccc ggcaaattcc ttcagcaaca tcttccaggc gcctccatcg 600 
ctgcgcacgg actccaccta tggccgaggc ccgcgctcga ccagcttcac cgacatcagc 660 
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aactggccct ccaacaacgc actccgcaac ccccagtcgg tgattgacat cgggggagga 720 
gtcgacttcc tgggggacag aagccctgga aacccgttca cgcggctgcg ggggtccccg 780 
agctccaccc tcagcaacct cggcatgggc ctaggcctgg ggctgggcaa gggcaagggc 840 
ttcggcaagg gcttcggcaa aggccggggg ttccccgtgg aggaggaggt ggaggaggag 900 
caggaggtgc tgtcgtgggc cgaccgccgg cgggcgctgg cggaccccga cgccccgccg 960 
atgaacgagg acatcaagta cccgcagctg cggctggtgc gggccgtgcc gggcggccgg 1020 
gacgagaagc tcggtgtgat gtcgaggcag gaggcgctgg agctggcgga ggcggaagac 1080 
atcgacctcg tcctcgtcag catcgacacc gaccccccgg tggccaagct agtcaattac 1140 
tcgaagttga agtacgagtc cgagaagaag aagaaggaca gccacaagaa ggggaaggtg 1200 
aaggaggtga aggagctgaa ggtgtcccat aagatcggcc agcacgacta cgacgtccgc 1260 
gtgaagcagg cccgaaagtt cctggagggc ggccaccgca tcaaggtgtc gatggagttc 1320 
aaggggcgcg agaaccagtt cgtggagatc ggccgcgcgg tgatgaagcg cttccagaac 1380 
gacctggcgg acatgggcaa ggcggacgcc gtgcccaaga agctcggcac ccggctgatc 14 4 0 
ctgaacctgg ccccggccgg ggaggcgctg aaggtgattg cggagcggag ggcagagcgc 1500 
gacaggaaag ccgcggctga ggaggagggg gagggcgacg acctcgactt cgtggacgag 1560 
aacgaggacg aggatgtgga gggggagggc gaggaggaag aggccgagga gctggaggag 1620 
gagacagcgg aggggacgga ggtgccaacc cgcagctgat cgccgatccg cgggggacag 1680 
ccacctcccc cccggcctcc ctgccggggg ccggcaccat ccgtcgttgc ggtgcggcgc 174 0 
tgccatcaac ggccgtcctt gagcttaatg ctcccgccct ccgttggccc gcggcggtcg 1800 
ccaggttgct ggcctggctg cccgcagctc ctcccctccc cgactgacac agtgtggatg 18 60 
accgtgatgt gcgccttttc gccttc 1886 
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<210> 37 
<211> 3015 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 37 

ccgctcatct ccaggcctcc ctgagtgcgt acccgagagc ggcaagtaga gaaaggaaca 60 
cagatacagc accatggcct ctaggctcgt ccgtgtgctg gcggccgcca tgctggttgc 120 
agcggccgtg tcggtcgacg cgcgcttcgt ggtgcgcatg gtgcaggtgg tacaccgcca 180 
cggtgcgcgc agcgcactca tcgacgacaa cacgacggag atttgtggca ccctgtaccc 240 
gtgcggtgag ctgaccggcg agggtgtcga gatggtccgt gctatcggcg agtttgcccg 300 
cagccgctac aacaacctct cattggtgga gagccctctc ttcccgtcga cgcggtacaa 360 
ctcctctgtc gtgcacacac gctccaccca cacccagcgc accatccaga gcgcgaccgc 420 
ctttctgcgc ggcctcttcc aggacgacta cttctacccg gtggtgtact cgaccaacag 480 
aacgaccgaa acgctgctca gcactgacgc ggtgccgtcc gtggtgggcc gtagctggct 540 
cgacaacccg gcgctgcacg ccgccctcaa cccggtgatc gatgagcacc tcagctggga 600 
cgccatccag agcgctgcca aggacgcatg ggtcgagggc ctgtgcgcgg actacaacgc 660 
ccgcaccaac tgcgtcctcg acatgtacga cgtggccgcc gccttcgagg ccgccgggcg 720 
tcttgacaat gccaccaatc tcaaggcggt gtatcccggc cttcaggagg tgaacgccgc 780 
ctggttcaag tatgtcttca gctggaacca cacgagcaag ctcgatctca cgcagggctc 840 
cgcctcgcag aaccttgcgc agacggtgct ggccaacatc aacgcccacc gcctctctcc 900 
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gtcgtacaac atgttccagt acagcgctca cgacacaacg gtgactccct tggctgtcac 960 
gttcggtgac cagggcgaga cgacgatgcg tccgcccttc gcggttacca tcttcgtgga 1020 
gctgctccag gacaccgcag atgccagtgg ctggtacgtg cgcctcatcc gcggcaaccc 1080 
tgtgaaggca gccgacggca cctatgtctt ccaggagtct ggtatcaagg catactgcat 114 0 
cgatgaagcc gggaacaagt acctcgcaca caccggcatc tgcccgctga atagcttccg 1200 
ccgcatggtc gactactcgc gccccgccgt ggctgacggt cactgcgcca tqacacagac 1260 
tcagtacagc aacatggatt gcccgcgcac tatcgcggac aacaagccgg tgccgtcgcg 1320 
ctgctggctc taccgccacg tttgccctag caaggcatgc ccggacagct acattctccc 1380 
cgcggtcgac caccagtgct accccgggcc cgacgttacg aaccccacca gcagcagcag 14 40 
cagcgagggt accaccacca gcagcagcga gggtaccgcc accagcagca gcgacgttac 1500 
caccaccagc agcagcgagg gtaccgccac cagcagcagc gacgctacca ccagcagcag 1560 
cgagggtacc gccaccagca gcagcgacgc taccaccagc agcagcagcg acgctaccac 1620 
caccagcagc agcgagggta ccaccagcag cagcagcgac gctaccacca gcagcagcga 1680 
cgctaccacc accagtagca gcgagggtac cgccaccagc agcagcgacg ctaccaccac 1740 
cagcagcgag ggtaccgcca ccagcagcag cgacgttacc accaccagca gcgagggtac 1800 
cgccaccagc agcagcgacg ctaccaccac cagcagcagc gagggtacca ccagcagcag 18 60 
cagcgacgct accaccagca gcagcgaggg taccgccacc accagcagcg acgctaccac 1920 
cagcagcagc agcgagggta ccaccagcag cagcagcgac gctaccacca gcagcagcga 198 0 
cgttaccacc accagcagca gcagcgaggg taccgccacc agcagcagcg acgctaccac 204 0 
cagcagcagc gagggtaccg ccaccaccag cagcgacgct accaccagca gcagcagcga 2100 
gggtaccacc agcagcagca gcgacgctac caccagcagc agcgagggta ccgccaccac 2160 



-193- 

cagcagcgac gctaccacca gcagcagcag cgagggtacc accagcagca gaagtgacgc 222C 
taccaccagc agcagcgagg gtaccgccac caccagcagc gacgctacca ccagcagcag 2280 
cagcgagggt accaccagca gcagcagcga cgctaccacc agcagcagcg agggtaccgc 234 0 
caccaccagc agcgacgcta ccaccagcag cagcagcgag ggtaccacca gcagcagcag 24 00 
cgacgctacc accaccagca gcgacgttac caccaccagc agcagcagcg agggtaccgc 24 60 
caccagcagc agcgacgcta ccaccaccag cagcgacgtt accaccacca gcagcagcag 2520 
cgagggtacc accaccagca gcagcagcag cagcagcaaa agcacaagtt catcggatgt 2580 
cccttccttc aaaaagcccg cgaactggag cccgcgcgtt ctctcgccgg aaaggggccg 2640 
ccacattgcc ggggacatca tccgccgcgt gacgaacggt gttacgatcg gtgcgggtgt 2700 
ccgaaagcac gatgagtaca gccggcaccg ccaacagtag cacaacggca tgtaactctt 2760 
ttgtgcatgt ttgaatggag aggaggcttc tgtacagcgt acattgtttc gagaaggtat 2820 
cacaaccgct cgtttcaccc ccgtcatctt ttcattttga tctccgtcgt ctcatactgc 2880 
ctttgtgggc tctctctggg tgtgggcgct tgtgcgtgtg tcgctgtaaa gtcgttgacg 2940 
ccatcgctct tacctgtggg ctattttttt aattatggtt tattattact tccctctctg 3000 
cgcgtccctc tgcag 3015 



<210> 38 
<211> 38186 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 38 
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gatccttcct gcctcttccg gcgtctgggg ctccaggcgc ccttggcttg cagattgatc 60 
tccctgatct ctgcctccat ctgctcacag ccttctcccc tgtgtgtctc tgtctcttct 120 
tgtaaattca tccgtcgttg gatcagggcc cacccggttc ctcgtggcct cgccttaact 180 
gggccatgtc tgcagagacc ctatttccac ataaggtcct attcacagga accgggggtc 240 
aggatgtcag cctgtctttc tgggagatgt agttcaaccc acaacacaca tcaaacagtt 300 
attgagcgcc gactgcgtgc cctgccgtgt gcttgaaggt cccaccctca ggaagcgggg 360 
cctagggatg gcggccgtga tcacgcaggc agcagagagc agctctggga agcggggagg 420 
gacgaggacg gggaggcgac atcagcaagg ccgtgtgtga gccaggcagg gtgtccccgg 480 
tgtagcacct ggctcgggca gaggccccga ggaggggctg gaggagctgg gcgaggaggc 540 
gggcaggacg ggcctgacac tagggacctc gggccccggg aatgcctctg ggggggcgtg 600 
tacacccgtt gctcccagga ggcacacact gcggttcgct tcgccaagaa tgtttaattg 660 
catttgatga ctacggtttc cattcattca tttgtagaga tataacactc agaccacaaa 720 
atgcataaaa tgcggtggct tttagtatta acagagtgct gcacccgata ccacagcctc 780 
actccagaac attctcatgg gcccaaaagg agacctgggg tgttagtcac cagctcactc 840 
cccgtcccca gcccctggca acccacgcta cttagtcatt atttaggtgt ttaggagttg 900 
caaagtcaaa tctttaaacc cacatatggc caggcgtggt ggctcacgcc tgtaatccca 960 
gcactttcag aggccgagac gggcagatca cctgaggtca ggagttcgag accagcctgg 1020 
ccaacatggt gaagccccgt ctccactaaa aatacaaaat tagccgggcg tggtggtggg 1080 
cgcctgtaat cccagctact ctggaggctg agacaggaga atcgcttgaa cccaggaggc 114 0 
ggcggttgca gtgagccgag attgtgccac tgcactccag cctggacaac agagcgagac 1200 
tccgtctcaa aaaaaaaaaa agtaccaaaa agtgccccag gtcataaggg cacagctcga 1260 
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tagctggtcc ctaaagggaa cgtggtgtaa ccaccacaca gaacgaagct ggaacgttcc 1320 
tgccgtcctt agaagctgcc tttgctaagg ggaattgccc tgacttccca caccattgat 1380 
tcatctccag acccttggtt ttcatgttga tttttcaaaa atcacctgat agtctgaccg 1440 
aatgtagctt tccactggtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgagagagag 1500 
atggagtctc gctctgtcac ccgggctcca gtgcagttgt gtgatcttgg ttcactgtaa 1560 
cctcctcctc ccgggttcaa gagactcgtg cctcagcctc ccgagtagct gggattacag 1620 
gcacccgcca ccacacccag ctaatttttt gtatttttag tagagatggg gtttcaccat 1680 
gttggccagg ctggtctcga actcctgaca tcaggcgatc cacccacctt ggcctcccag 1740 
agtgctggga ttacaggtgt gagccaccac gcccggcctt atttttcccc cattttcttt 1800 
tttttttttt ttgagtcagg gtcttgttct gcgctcaggc tggagggcag tggtgtgggg 18 60 
atcacggctc actgcagcct cgacttcctg caccaccacg cctggctgtt tttttttttt 1920 
ccggtagaga cgggggtctt accgtgttgc ccaggctggt ctagaactcc tgggctcaag 1980 
cgatcctccG gcctcggcct ccgcaaatgc tgagatcaca cgcgtgagcc cccgcacccg 2040 
gcctcctttc caccgctctt gtctacagcc gcccctcctg gtccgattgt attggcagat 2100 
gtcgccaata cggtgtcaaa cggcgaaggg gcactgagcg ttttttcttt ctcccgtcct 2160 
tggcggcagc agctcggttc cggctacggg gctgagcccg tctctcagac gaggaaactg 2220 
gggtccgaga ggtgagccgg tcccagaggc agggcgaggg ggaagcggga gtggggtccg 2280 
cagcggaccc agccctgcct cccccctgca ggagatcgtc aacttcaact gccggaagct 2340 
ggtggcctcc atgccgctgt tcgccaacgc cgaccccaac ttcgtcacgg ccatgctgac 2400 
caagctcaag ttcgaggtct tccagccggg tgactacatc atccgcgaag gcaccatcgg 24 60 
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gaagaagatg tacttcatcc agcacggcgt ggtcagcgtg ctcactaagg gcaacaagga 2520 
gatgaagctg tccgatggct cctacttcgg gggtgagctt gaggggggcg cgcctggagg 2580 
gggagggggc acgcgacccc cgcggtgtgc agagccaggg ggccggggcc ggggccgggg 2 64 0 
ccggggatgg ggatggggat ggggatgggg ccggggatgg ggatggggat ggggatgggg 2700 
ccggggatgg ggatggggat ggggccgggg atggggatgg ggccggggat ggggccgggg 27 60 
atggggccgg ggatggggcc ggggccggca ccagggagag cctgggtggg aagcgcccac 2820 
gctggccaag gtgcagaggc cgggccgtgt gcctgggcgg ggagggccgc ggcgcccgcc 2880 
tcgtccagca acccccccct gcgcgccacg tgcagagatc tgcctgctca cccggggccg 294 0 
ccgcacggcg agcgtgcggg ctgacaccta ctgccgcctc tattcgctga gcgtggacaa 3000 
cttcaacgag gtgctggagg agtaccccat gatgcggcgc gccttcgaga cggtggccat 3060 
cgaccgcctg gaccgcatcg gtgagcgggc cgggggcgtg gccggggcgg gtgccctggc 3120 
gggggagggg cgtggccaag gcatcaggag agtggcttgg acagtggcag ggggaagggc 318 0 
gtggctgtgg catcaggggc acggttgggg cagagacgtg gccaaggcat caggagtgtg 3240 
gccatggcag caggggcgtg gctggggcag gggcagcggc tggccgctcc taggacccct 3300 
ttgggtctag aggctgattt tctgacctat tgtcctactt cagccagagg cagcctgttt 3360 
cccaagggag ggaatgcaca gggtgtttgc ggttgtgccg aatgctcggt gagcacctgc 3420 
tgtgtgctgg gggtgcaggg gacagacccg ggggcccact cagactccca gggaggctta 34 80 
tggactggtg atgaaatcac acacgactgg gctgtgtgcc agcagggcag gtggggccgg 3540 
tgggcttccc tgagttggga atgcagagtg gagaccaggg taagggatgc catgtggaaa 3600 
cggggaggaa gatgtgttcg tggagtggac acagcacatc ccaaggccct gaggtggaaa 3 660 
agaggcctag agtccagaga gccagggagg cctggaggag gttggggaag aaggggaggc 3720 
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cagacacaca gggcccagtg ggcggcaggg agagtttaga ctaaatcagg agcatcaggg 3780 
agccatggag ggttctaggt gggcggagga cctggtcaga ttgtarccgc caaggcgggc 3840 
cgtgtccagg agggagacgg tgacctggcc tctcaggggg gcagtctctg gggcagggag 3900 
cggcagagcc ctgatgactg gatgtaggcg ccagagagat ggcggctcat gctgctgttc 3960 
gtgggaatgg gaatgaagac catggctgaa acgcaggaca ggtgcgacgg agtggtgtca 4 020 
gggagctccc tggtgtacag taggaagctc tccacaactt gctctataca gtgagtatgc 4080 
aacccgttcc tgagtatcag gtgcttaggt tataacttct gtatacagca ggtgctcagc 4140 
acaggctgtg tacaggcagg tgttttcggt atgcctgtgg cacactggag gcagtcatta 4200 
cataatcagc gtatacaggt ggtacacatg catacttggt gcacagtgat acctgctcca 4260 
tgtacacagc aggcattaaa tacctgttta ctgccaggcg cggtggctca cgcctgtagt 4320 
cccagcactt tcggaggcca aggtgggtgg atcacgaggt caggagattg agaccatcct 4380 
ggctaacatg gtgaaacccc gtctctacta aaaaaaaaat acaaaaaatt agccgggtgt 4440 
ggtggcgggc gcctgtagtc ccagctactc gggaggatga ggcaggagaa tggtgtgaac 4500 
ccgggaggtg gaccttgcag tgggccgaga tcgcgccact gcactccagc ccgggcgaca 4 5 60 
gagcaagact ccgtctcaga aacaaagcaa aacaaaagcc ctgctttctg tatgcaggtg 4 620 
cttcatgcat gctggctgtg catagcaggt gctcagcctg tatatggcag gtactcaata 4 680 
tccatactat aggccagaga tgctacatat gtgcttattg tatacagtag gtggtaaatg 4740 
catgcttgct ctacacggca agcactgtgt gcgcacccgc ggtgcagagt aggtgctcgg 4800 
tgcccgctgt acgcagcagg cgctccctgt gcacacgcta acgccccctc tcccgcaggc 4860 
aagaagaatt ccatcctcct gcacaaggtg cagcatgacc tcaactcggg cgtattcaac 4 920 
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aaccaggaga acgccatcat ccaggagatc gtcaagtacg accgcgagat ggtgcagcag 4 980 
gccgagctgg gtcagcgcgt gggcctcttc ccgccgccgc cgccgccgcc gcaggtcacc 504 0 
tcggccatcg ccacgctgca gcaggcggcg gccatgagct tctgcccgca ggtggcgcgg 5100 
ccgctcgtgg ggccgctggc gctcggctcg ccgcgcctcg tgcgccgccc gcccccgggg 5160 
cccgcacctg ccgccgcctc acccgggccc ccgccccccg ccagcccccc gggcgcgccc 5220 
gccagccccc gggcaccgcg gacctcgccc tacggcggcc tgcccgccgc cccccttgct 5280 
gggcccgccc tgcccgcgcg ccgcctgagc cgcgcgtcgc gcccactgtc cgcctcgcag 534 0 
ccctcgctgc ctcacggcgc ccccggcccc gcggcctcca cacgcccggc cagcagctcc 54 00 
acaccgcgct tggggcccac gcccgctgcc cgggccgccg cgcccagccc ggaccgcagg 54 60 
gactcggcct cacccggcgc cgccggcggc ctggaccccc aggactccgc gcgctcgcgc 5520 
ctctcgtcca acttgtgacc ctcgccgacc gccccgcggg cccaggcggg ccgggggcgg 5580 
ggccgtcatc cagaccaaag ccatgccatt gcgctgcccc ggccgccagt ccgcccagaa 564 0 
gccatagacg agacgtaggt agccgtagtt ggacggacgg gcagggccgg cggggcagcc 57 00 
ccctccgcgc ccccggccgt cccccctcat cgccccgcgc ccacccccat cgcccctgcc 57 60 
cccggcggcg gcctcgcgtg cgagggggct cccttcacct cggtgcctca gttcccccag 5820 
ctgtaagaca gggacggggc ggcccagtgg ctgagaggag ccggctgtgg agccccgccc 5880 
gccccccacc ctctaggtgg cccccgtccg aggaggatcg ttttctaagt gcaatacttg 5940 
gcccgccggc ttcccgctgc ccccatcgcg ctcacgcaat aaccggcccg gcccccgtcc 6000 
gcgcgcgtcc cccggtgacc tcggggagca gcaccccgcc tccctccagc actggcaccg 60 60 
agaggcaggc ctggctgcgc agggcgcggg ggggaggctg gggtcccgcc gccgtgatga 6120 
atgtactgac gagccgaggc agcagtgccc ccaccgtggc cccccacgcc ccattaaccc 6180 
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ccacaccccc attccgcgca ataaacgaca gcattggcgc caagcctggc cgcgtgtgat 6240 
tgcccgagac ccgcagggcg tgcacccttc ctgaagacag tggctcctgg gggrggcaaa 6300 
agagctttat ttacacactg acaaggctca cggggtgtca gctgaagaag taggtggaac 6360 
gcttcacctg ctccaggtcg aaggcccctg cggaggaagc agagcggacg gcgtgggtqg 6420 
cgggaaagcc ccgccctggc ccgcagttcg agccaccctt gcgaggctgc ccacccgcct 6480 
acctggcttg ggcaccgcct gcagtgtctc cttcagctgg ctggcctcca agatcttctg 6540 
gggcctgggg ttggaagcag ggtggggtga ggctgaggcc aggttttggg gtgggggggg 6600 
aatccaggta gttggggtca gggagcgcct tactcagagc agaaccgctt gaccaggaat 6660 
ctggacaggt cctgcaggat gggctcgctg tgcaagcgga caaactgctc ccggcacacc 6720 
tgggcaggag tcagaggatc cccaggggtg atcaggcagg ctctgggcac cacccctacc 6780 
caacgcccca gtgtgggggc cccacccatg ggtggactga ggctcagact acgggggcac 6840 
ctggttcatg acggagacat cagctgcgtg agtccagtaa cagtcgtgca cagagacgaa 6900 
ggtcaggccc ttcctgtggc agagcggagg actcctgaag ggaggggagc tcacagggcc 6960 
acccagtgac cagcatcctg gccctgcgct cagccccctc cacttgaggt ccagggaagc 7 020 
Gcaccctcct gcaggcctcg ccccacccct cgccccgccc ctccccaaac atcctgggtt 7080 
aggtatcagt acagggggag gaaatgttcc cagaagcctc ctcgccccac ccctgccgcc 7140 
ccccacgctg ctgtgggagc ctcagctccg agggcggcta cgaggtcccc tcctgccagg 7200 
gccaccaccc cgcatcctga gcattcccag ctcccgtggc cggtagattc tgctggaacg 7260 
acctccacgt gctccagatc taaccacaca tcgcggtgcc aagaaatgcc cagcaggaag 7320 
gggcagcgcc catgctcggc tctccctgtc gggccacagg aggggagctg ccaggaccac 7380 
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ctacattcgg ggcacacagc ctcagggcct ctacacaggc cccacagaca cagcagatcc 74 4 0 
actctgccca gtccctgccc ccagctagac ccagccttgc cagctgtgcc ctgctagcca 7500 
gaagacgccc ctgggaggcg agcggcaccc acgccgtccg gagacgccca cctgtagcag 7560 
tgcagggcgg tgagcatcat gtgggaggag tccagcgagt ggatgaagtt gggcgggaag 7 620 
ccgttcttct gcttacgtgt gttgggcttt ctgaggacgg aacaggtgcc ggtgggggcg 7680 
gcccagggac acccctaact ggccgctgtc tccaccgtgg ctgctctcca gacccccggc 774 0 
caggccccag cccgggcccc ccactcaccg gctgatgtct ccgttgtggg tgtaggtgat 7800 
gctctgaatt ccacctccta tttgctaaaa aggggaaggg gccggtgagt cccacccgag 78 60 
gcccagcacg gtggtggtac attgaggtgg tggtacactg gggtagtggc acgctaggat 7 920 
ggtggcacac tggcgcggga tggggtggca cactggggcg gtggtacact ggggtggtgg 7980 
tacgctgggg cactggtaca ctgggacgct gttacactgg gatggtggca cactggggag 804 0 
ggatggggtg gtacactgac cttgaccttg gagtccaggc gatagggctg gatgacgggg 8100 
acgcccaggg gtgtgaccca ctccaccaca gagcccatgt gggagatgag gcgggcactc 8160 
tcggtcagcc agtgctgtgg gacacaggcc gtctcagggc agggggctca ggccggggat 8 220 
CGcgtccact tgcttaggga gtcctggccg agcggggaca ggacaggacg tacctggatg 8280 
gcccgggtcc ccgagaacat ctcctgtaga ctcttgaaga cctggcgtac gagatagtga 8340 
gaggcctccc acacgaactc ctgcagaggg cgggcagcag gtgcaggtcc tcaggggctg 84 00 
gcccgttcac gccctactcc cccctatttc agagccactg aggcccaagg cctagggcct 8 4 60 
agcagggggg caggggaatg gggcctggcg cccacgcagt cagcaagaaa cgcccaagcc 8520 
ctaacaggca gccagtggtc tgggggagca gccagggctc ctgctgggag gctgggtcgg 8580 
gggcacaccc gtctgagttt taaatggcag tgaaaccaac gtgttcgcag cgcgacatgc 8 640 
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ctggcgcacc tggggaaagt cgctcagctc 
gccatagcgc gtgaccccgt acaccaccgt 
gatgaaacct tccagcacct gtgccacccg 
cacctccacc tgcacggcgg gtgggccggg 
ggccaccaag cacccatgaa gcccccgccc 
ggtgagggca cctggggccg agactcaggg 
gggcagcagg gcacacccta cctgcgcggc 
cgagggctcc aggttgacgg aggcggcgcc 
ctgcaggccg ttgcaagagc cgtcctgagg 
agaggcagag acgtgtggga ccccaaacca 
gcacccccca gccaagtgca ccggagcccc 
ggctcccgca cactctagga ccacctccag 
accctcaagt cgagccccag gcccagtgca 
tggctcacct gatggacggg gaggtgggag 
gcgttcgcca cctccataca gcaggccagc 
cactttcggc cctgcgggga cagcggatgg 
gcccgctggg aggctgtgtt gcggggaggt 
tagtgaacac gggacgcatg tgggcgagag 
gccacggatg gaggatggga gctgcgggtg 
gccgggggag gagggtggac agaggacctg 



ccggaggcgc ttctcaatct gcaggcgccc 8700 
catcaccgtc tgcttcacca ccttgcgggt 8760 
catgccccgc tgggcgtcct gcctacggaa 8820 
ggcgcgggtc agccccgcta gcagcccagg 88 80 
cagccccacc acatcctcag gacaggccaa 8 94 0 
ctcacattgc ccccacgccg agatgccccc 9000 
cacgccgctg tacacgtcct gcggcacatc 9060 
cacgctgtcg cggcccagag cagcataatg 9120 
aaggggcggc aaacgggaga tggaagctag 9180 
ccccccaggt cgagccgttc ctagggccgt 9240 
cgcacgctcc cgggagagac caggagccat 9300 
agaataccac gagcgaaggt gaaatctcac 9360 
cactgcacgg cctcgggggc cagacccagc 9420 
acataggcgg cagggtcgga ggcgcgcaca 9480 
gtctgccagg gttcctccgc gcccatccac 9540 
ggggcagtga ggcccgggcc cgatccctga 9600 
gggaaatggg gaggagacgc acacccgtga 9660 
acggggcggt ggctggatga gttctccata 9720 
gaccgggctg aaacaagcgt gtccggagct 9780 
ggggcgccgg gggaggaagc agctcggcgg 9840 
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atgcagggga ggggggaacg tggggaacgc gggggccctg gggcagggga gaagggagaa 9900 
gcaggacggg cagggggcgc gggggaggag agcgggcggg ggacgtgggg gcgccagggg 9960 
agggggaggg gaggaggaag acgggcaggg ggcgccaggg gagggggagg ggaggaggaa 10020 
gacgggcagg gggcgcgggg gcgccggggg agggcgcggg ggcgccgggg gagggcgcgg 1008 0 
gggtgccggg agggcgggga atgcgggggc cccgccccta ccgtcaaggg ttggtccgcg 1014 0 
gagtccagga tgtcatccat cacctcctcc gcaaaggcca ggcgcttccg cagcggctcc 10200 
cgcttcttca accccgtgag attgaccagg tggatcttga gccaatccag gccgtgcggg 10260 
ccgagcgggc ggccctgggc gaactccagc agggcccgcg ccacgtcgct gcccaggtgg 10320 
ttgaagtgcg gcgggcaggg gtaggtgcgg ccgcggaagt ccatgttgtg cggcagccag 10380 
aagacgcggt cccgcaggtg ctgcgccagc gagaggcggt acagcgcctc cgcccgcagg 10440 
ctgtgcatct cccgggccac cttctggcag tgcgccagct cacggcgcag ctcggccttg 10500 
cgggcgggcg cggcgctgtg cggcaggtgg gcctcgggcg gctggggcgc ctcggagggc 10560 
ggggccggca cgcctagctg ggggcagccc ttggcctgga agagctgcag caccaggtcc 10620 
agcacgcgcc cgttgacgcg ccaggcgcag ttgcccagtt gggtgagggc gtccagtgcg 10680 
ccatgcagcg cggtgggcgg gcaggtttcc agcagctcct ggtgctgcgt ggcgccttcc 1074 0 
accgtgcgca tcagcttggt ggggctgagc aggaaagcac cagagtgcgg cgatgtccag 10800 
ggcagcgggg ggcaaagcat gggtacatcc accgcctcga aggtcagcgt gggctccgcg 108 60 
gccttctcca gcagctgcac gtaggccggg tgcggcttca ggatgccgat ctggggtgcg 10920 
acaggcagac gggtcagggc cccggtgctg gggctttcct gttcccaccc cttaaacttg 10980 
ggtgagaggg gccggctccc cggccaacaa gaaaccagtg tggcctccca cgaacagaag 11040 
ccacctccag aaacggccgg acacctgcat ggacacccat ggtgtgtccc gagtcctggg 11100 
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aggtactgac 
cctggtcctc 
cgttgcggaa 
tgtccaggct 
gctccatctg 
cctcccagta 
gctgggcacc 
catctgtcag 
ccagggaggg 
ctcacctcgg 
tgcacctggc 
agctcccggg 
ggaaggcagc 
gcacctgcag 
gtgagaaccg 
ggcacagtgc 
gaacgggccc 
ccgcagctcc 
tcctcacccc 
cccagctgca 



ggctgcgctg 
ccagcgggat 
ggaatacacg 
gcatggcatc 
cactggcagg 
ctgccgcggc 
ggggcccctg 
cccaagcata 
agagcgccca 
cgtcggaggc 
cactgacccg 
ccagggtggt 
cgtgagtgcc 
gagcatccgc 
gccctcgtac 
tttctcccat 
ggtgagcccc 
ctgcagagac 
ggcctgccct 
gtgtgcgcag 



agatcaaggc 
gccccccagc 
tggtagagca 
tgcgtagcct 
ggccagggct 
aggcagggct 
agctagatgc 
cagatgaaca 
cccaccactg 
cagcaagcag 
ctgcctctgc 
gaaggactca 
tgcccgcccc 
accacctcgc 
acctcgcgct 
tggtcccgca 
gtggcagctg 
ctcatggccc 
ctgccggctt 
aagcacaggc 



tccgcccaaa 
tcaggagggc 
cggggacaag 
gcaccagcat 
gctcccgcag 
cgggcacctg 
cccaccgccc 
gactgaagct 
gccccagcca 
aggtacttcc 
accacgtgcc 
ccttgggcgg 
gcccggggac 
gctcgtccag 
ctaggcggtt 
gggtcttcag 
gtgggaccca 
tcaaggtccc 
cagcgtgcct 
caagacccaa 



ggcgccaacc 
actgcctggc 
ccgagaggaa 
ctccgccagc 
ggcctcgggc 
taggacaggg 
gtgcctgacg 
tgggtgcaaa 
ggaggagagg 
tgtagtggtt 
ggctgaaagt 
gcagcgcctg 
ccggccgcgc 
caggcacagg 
cttggtctcc 
ggtcttccgc 
ggct cacagg 
tgctgtgtgt 
gacgcagcca 
cctcgggacc 



ccatggggtc 
acctgctgga 
cgatgcggct 
agcttgccca 
gcccccagcr 
cggtcagggc 
cccggtgggg 
cccggctgct 
gggtgcgagc 
ctgcagcgcc 
gcgcgcactc 
caggacctgc 
ggaggaagac 
aaggggtaaa 
cgcagcgccG 
tgcgggggat 
acgggggtca 
tccgggtagc 
agagcaaaag 
ccacaagttt 



11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
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tccctgagcg gcagccaggc tgagttccta ggccctgcat gaccagacca gggcatgagc 12360 
aattcaaccg catacacgga gctcagcccc tgcggcggac acgcgacccc ggctcagccc 12420 
ctgcggcgga cacgggaccc cggctcagcc cgtgcggtgg acacgcgacc ccggctcagc 12480 
ccctgcggcg gacacgggac cccggctcag cccctaccgc gtgcttgacc tccttgcttg 1254 0 
gcaacgtggg cttctccacg gacaccacgc acaccctgct ggccagctcc atgtggagct 12 600 
gcttctcaaa gaggcactgc agggtcttca agggcaggtg cagcttcggg taggacacac 12660 
gcccatcctg cagggatggg ggtagtgagg ttgggggctt gccagagggc gacctgccct 12720 
cccaggaccc cgagacagca tgggtgcacg cgtttctgcg tctcctgcaa gttgctggtg 12780 
gctatcgctg acgcggggaa aggcgggctg cgggtaaagt cagtgccagc agtgcaaacc 1284C 
aaaggccttg accctcctgg cctcgacccc tctagaaggg acactgggca ccgtgcaggg 12900 
TOtggcaggg gcggtgatgc tgggagctgg cagagcctgg ggagaccgtt cactgcaccc 12960 
ccagatgttg gctgttttct cctcaaactc agaactgtat gaatgtgacc catccagaaa 13020 
tagatgaatt aaaaataaca actaaagcct agcgctttga gaatcaaaga cgcacgtcca 13080 
cataaaagct tgtacacaaa cgttcacagc tgcatgactc gcagtcgata agtagaaaca 13140 
gcccaacgtc ccataaacgg acgaacagac gggcacggcg cggccatcca cgcaccggag 13200 
catgactcag ccctgaccca ggtcgcctcc cggaggcacc atgaggacgt cacgctcagt 13260 
gggagatgcc aaacacaaaa ggtctcgcag tgtgtggtcc catttctatg gaatgtccag 13320 
agcagactca tccacagatg gggaggggat ggggagtgac ggggatgggg acgaggcttc 13380 
cttttagggt gatggaacat tctagaatta gacaaccgtg actacactaa aatcgctgaa 13440 
ttacaccttt aagagggttt tatggcaggt gaattacacc tcagtaacag acgagcccac 13500 
tgcgtgcacc tggcagcccc actcaaacgc actgctctcc tgtcacccca ccctctctct 13560 
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gcggcccccg 
tctgggtcct 
gccctgggca 
agctataagg 
gagcagcttg 
caccttgtgc 
gaagagtgcc 
gcggcctgct 
tcgtgaccac 
cccagccGCC 
ggtgggggcc 
cccatgcact 
accatgaata 
gggtctgggg 
tagccaccag 
atgcatcccc 
ccaggtggga 
ttgtcaattg 
tgccttccat 
ggtttagtga 



accacctcgt 
cccatcttct 
ccggggaggc 
actgaagtct 
gaggtgttga 
acggccttca 
tgcagcttca 
cgagggacgg 
ctcctgctag 
agcGcaggca 
ctcccgacac 
ggagggcagc 
acacatatac 
ggatggccca 
ctcagcaggg 
ctgaggttct 
ctgtgggagg 
tgtggatctt 
ttcatcgatt 
cagcttcttc 



ccccctgagc 
cagaggagga 
ccattggtct 
gccggggccc 
ccgggggcgg 
gaacagtggc 
gcccctcctg 
gccagcccca 
cctgcaggtc 
ccgtcccaga 
cacctacctt 
cgcataggac 
cagctccttg 
acctccacat 
gacagggtca 
gacacacaag 
ttcacgttcc 
ttcaaaggac 
tgttttaatc 
ttctagtttc 



ccacaccctc 
agggaggaat 
gagctgaggc 
acgtgggctc 
cagctgcggc 
ccgatcctcc 
gctcatctgt 
cgctgggctt 
tcggtgtggc 
tcttaaaacc 
tcgatggtcc 
agcaggtccg 
aaggcaccct 
cctccctgct 
ccaggcagga 
ctccacctgc 
tctgggaggc 
tgacttggct 
tttgtaactt 
ctaaggtgaa 



agggccaaga 
tcagggccca 
tccaggaacc 
accttggcat 

gggsggctga 

tcagacagca 
tccagacacc 
ccacagaccc 
tgtcaggccc 
ctgggaggga 
cggcgtcctg 
gagtcaagcc 
gggagaccaa 
ccctggagac 
gtggccagct 
agaggcagcc 
agcttgttaa 
tgactgttct 
cctctcatct 
aggtgacgta 



ccctcccagc 
gcccaggtga 
cccaaagggc 
acacgtccct 
aggtgggctt 
gaacggcggt 
tgtggtgcag 
caggggaacc 
tctgggggtc 
catggtgggg 
gtcctgcctc 
ggcatccttc 
gccagggtga 
cccttctctg 
gggcagaccg 
gcatggcccg 
acctccagat 
ctgctgtttc 
acttgcttta 
tttggtctga 



13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
1422C 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
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gatgtttcac tttttttccc cccaagatgg agtcttgctc tgttgcccag gctggagtgc 14820 
agtggcacaa tctcagctgg gccgggttct ctgcctccca ggttccagca cttctcctgc 14880 
ctcagcctcc tgagtagctg ggattacagg cacacgccac cacaccagct aattttttgt 14 94 0 
attcttagca gatacggggt ttcaccatgc tggccaggct ggtctcgaac tcctgacatc 15000 
gtgatccgcc agcctcagcc tcccaaagtg ctgggatgac aggtgtgcac caccgcgccc 15060 
ggccatcacc tttccgaata taggcatttt gtgactataa attaccctgc gagcactgtg 15120 
tcagctgcat cccaggactt ctgacaggtg gtgttttcat tttcattatc tccaagtgtt 15180 
ttcgaacttc atagtttact tcttctttgg aaattttatt taattatttt tttagataga 15240 
gtctcgctct gtcgcccagg ctggagtgca gtggcgcaat ctcagctcac tgtcaacctc 15300 
cgcctcccgg gttcaaccga ttctcctgcc tcagcctcct gagtagctgg gactacaggc 15360 
acatgccacc acacccagct aattattttg tatttttagt agagargggg tttcgccctg 15420 
ttggccaggc tggtctccaa ctcctgacct caggggatcc acccgcctcg gcctcccaaa 15480 
gtgctgggat tacaggtgtg agccaccacg cccagccatg tatagcttaa atatcccctg 15540 
caattttttt ttttttcatt taatttttgg ccaggcacag tggctcatgc ctgtaacccc 15600 
agcactttgg gaggccaaga caggaggatc acaaggtcag gagtttaaga ccagcctggc 15 660 
caacatagtg aaaccccatc tccactaaaa atacaaaaaa aaaaaaaaaa aattagctgg 15720 
gcgtggtggc tcatgcctgt gctccctcca ctaaaaatac aaaaaaaaaa aaaaattagc 15780 
tgggcgtggt ggcacatgcc tgtaatctca gctactggga gcctggggca ggagaatcac 15840 
ttgaacgcag aaagcggaaa ttgcggtaag ccgggatctc accactgcac tccagcctgg 15900 
gagacagaaa ctttgctgtc gacagacttg gagactctgt cttaaaatat acacacacac 15960 
acatatatat atatatataa aataacatat atatataatt tttttcttgt attcattttt 16020 
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cctgacatcc 
gtaactgtaa 
tcctgcagct 
agttttgggg 
gctggagtgc 
tcctcctgcc 
aattttttgt 
ctcctgacct 
ccactgtgcc 
tggccacaca 
ttcagtagca 
actggatggt 
cacttaggga 
caacatggca 
tccctgtaat 
ggaggctgta 
tctatctcaa 
ctgaaacccc 
cgcctgtaat 
tggccacccc 



ctgttctgag 
cagactattc 
gaagacatgc 
catttttttt 
agtggctcga 
tcagcctccc 
atatttttag 
cgtgatccgc 
cggccggttt 
tagtccttaa 
cacacagctg 
gctggttaga 
ggctgaggcg 
aaagcccgtc 
cccagccact 
gtgagccgag 
aaaaaaaaaa 
acctctacta 
cccagctgct 
ggaacgctgg 



caatttctcc 
caaagggaaa 
tgtgacatga 
tttttttttt 
tctcggctca 
aagtagctgg 
tagagacggg 
ccgcctcagc 
"tggggcagtt 

attgaaatat 
ttggcagatg 
atacgtggcc 
ggcggattac 
tctactaaaa 
caggaggctg 
atcatggcac 
aaaaaaaaga 
aaaataaaaa 
cgggaggctg 
gagaggcagg 



tttgacccag 
aaaattccct 
ggtggccaca 
tttgagacgg 
ctgcaagctc 
gactgcaggc 
gtttcactgt 
ctcccaaagt 
tctaaacaac 
tcagtrcttc 
cggaaaattc 
gggcgcggtc 
ctgaggtcag 
atacaaaaat 
cggcagggag 
tgcaccctag 
tagaagcaat 
ttaaaacaat 
agctcgcagt 
aggggcccct 



tggcrgctta 
tacatcctcc 
caccagagac 
agtctcgctc 
tgcctcccag 
gcccgccacc 
gttagccagg 
gctgggatta 
ctctgtatgg 
cctttcacca 
ccaacatcat 
gctcacgcct 
gagtttgaga 
tggccgggcg 
aattattgaa 
cctgggcaac 
gccttagcct 
tatccggggg 
ccagcgacat 
gctagagcct 



agagtggcct 
caccccatag 
cagagacatg 
tgtcgcccag 
gttcactcca 
acacccggct 
atggtctcat 
caggcgtgag 
tagacctcac 
gcttcaagtg 
agaaagttct 
gtaatcccag 
ccagcccggc 
tggtggtgag 
cccaggaggc 
agacagagag 
ggctaacatg 
tggtggcaca 
ccaggactgc 
ctggagagac 



16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
1674 0 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 



ttcgggtctg 
ctgtgcaagc 
acagtgtggg 
acgaccctca 
cagcacgccg 
acctgatcct 
acgaattggt 
caaatattaa 
gggagaaagc 
aaccaagagt 
ctcctcggga 
acacggtgaa 
acctgtaact 
gaggtggcca 
tcccagaaaa 
tccagatgaa 
cggcgccagt 
cacctcagga 
gagggctctg 
gagccccgag 
tttggtcatt 



cagacatctt 
cgccaggttt 
ctctctgccc 
tcctcagtcc 
cctccacctc 
gaatttcgct 
cttaatattg 
gagaagcttt 
tgggaacgtt 
tccgaggaga 
ggctcagggg 
acccccgtct 
ccagctactt 
tgagccgaga 
caaataaata 
ccacaacgaa 
gaacacccac 
aaccagaaaa 
caggctgccc 
aaagaacgga 
ttcaaaattt 



gattccagac 
gggacacttt 
agggaatcca 
tcccggagtc 
gcggtaagag 
gtttgatgcg 
cgatatctgt 
tactgttgtt 
tgctgctttg 
aactttggcc 
ggcagatcac 
ctactcaaaa 
aggaggctga 
tcgcaccacc 
aacaaacaaa 
gatgggtggg 
atttcacaca 
aggagcaaag 
cccgacgagg 
aggaaatgct 
gagagtcaca 
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ttctgggctc 
cgtaggggcc 
gggggcttgt 
tggcgccccc 
cactgcggac 
ttaataaaga 
attttaatat 
tctcaaatta 
aaagggtgtg 
ggatacggtg 
gaggtcagga 
tacaaaaatt 
ggcaggataa 
gcactccaac 
aagaaaacgg 
atacatctaa 
gaaggatctc 
tcaaccccaa 
ccaaaagcac 
caaggcatta 
cgtgatttgt 



gtgctaagag 
gatcccaaaa 
gaccttggag 
tgcagcaagg 
ttcaccgcaa 
agcacatcaa 
aatagtatcc 
gggctgaagg 
taaacaacac 
gctcacgcct 
gtttgagacc 
aatcgggggt 
tcacttgaac 
ctagtaacag 
caagggaaat 
agctgtgctc 
agcacagcag 
caccaaagcG 
ccctgcccag 
gcagaatttc 
atttgaaaag 



tgcgtttctg 
gcgccctgtt 
gggaaaatac 
aggaaccagg 
gactggcccc 
gttctctacc 
catgtttacc 
atcatggggg 
cctccaaaac 
gtaatctcag 
agcttggcca 
ggtggcgggc 
ctgggaggtg 
ggagagtatg 
tggaaaatac 
^*?sigggaatg 
cccgaccttc 
tcatcctgac 
acagattcac 
tccctacttt 
cctaaaagaa 



17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17 7 60 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
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ttattaaaat 
aatgagggca 
cacctgccgc 
cttctgccgc 
gagcaggcag 
cgagtgctgc 
cagcagccgg 
caggcacccg 
cgccttgaac 
catctgctgg 
ggtggcatcc 
gccacattct 
agcctgcagc 
gagggacccc 
gcaggtggct 
gtaaggtctg 
cgagcctcac 
tccccggtca 
acaactccct 
gtggcccccg 



aaaaaacaaa 
agttcctgtc 
gcccagccaa 
tggccgtggt 
cacttgaaga 
cccgacgggg 
gccagctgct 
gccatctgct 
tccccgctct 
gtccgcttat 
ttggccccca 
gggagccgcg 
tgccgcaccc 
ctccccattc 
gagctcgctg 
cccgctgctt 
agaactcagc 
ctggccgctc 
ccgtccacct 
cctcccctgc 



ggacttgaac 
tccaacgacc 
gcatcacggc 
ggacgaccag 
aggccaggag 
cctgctccac 
cctcccaggg 
tgctcaggag 
ggaatggcat 
ccttctccag 
tctggacctt 
ccacatccac 
gcgcctggag 
gagcacccgt 
ggacccaagg 
tccagacaca 
agccctaaca 
agaccctcca 
ctcagtacct 
ccctctcact 



ctgggggcta 
agggcaggtg 
gttgtacatg 
caggtggtgg 
cctctgctgc 
atcgaggctc 
gctctcgggg 
ccggggctcc 
ctgcagcttc 
tatctttgcc 
cctgggtggc 
cctgttcacc 
cactgtgagg 
ctctctggac 
cgtgaattcc 
cgcaccccac 
agctgccacc 
ggtgcacagg 
cctctgggct 
gtctagctca 



agagagaaaa 
gcccggctcc 
tccagcgtga 
gcgaggggca 
tgacctgaga 
agcttcccag 
gcctggcgcg 
acctgcaggc 
gccttcaacc 
cagcggccac 
tggaggctac 
accacctccg 
ggcagaaggc 
cctgagccag 
tcatacttgc 
cacctccgca 
gaaacctgca 
cccagaaccc 
tgcctccaga 
gggcctctgc 



gtccagtcta 
cggctgcact 
gcagcttccg 
gctggtcagt 
gctgggcctg 
gggcctcctg 
tgcagtcctc 
gcctggtcag 
gctgcatacg 
agggcaccgg 
catctccact 
acacgctctc 
gaggacatga 
gccaggaggt 
caacaacgtt 
cctccccacc 
gcaccacgtc 
ggggtctgtg 
atctatccag 
acagactccc 



18540 
1S600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
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aggaccctga accgcccact ccctggctca accatggcct gcaagttcgc accccgcctc 19740 
agcaagaccc ccccagctgg tggagctgcc acacacacac tcctaggctc ccagtgtcta 19800 
caccggtgga cgctgagcca ctagctcgca gggaaaacgc ggctcctgct cgtgccgccr 19860 
caggttgcat ttttgccaac caatcaatgc ctaagtgttc tgtatctctt taaagaagcc 19920 
ttgttggaaa tctattgctg gccgggcatg gcggctcacg tcggtcatcc cagcactttg 19980 
ggaggccgag gcaggaagat cacctaaggt caggagttcg agaccagcct ggccaacatg 2004 0 
gtgaaacccc gtctctatta gaaatccaaa aaattagctg ggcgtggtgg catgtgtcta 20100 
tagtaccagc tacttgggag gctgaggcag gagaattgct tgagcctggg aggcagaggt 20160 
tgcagtgact caagatagcg ccattgaact ccagcctggg caacagaaca ataatccatc 20220 
taaaaaaaaa agactgttga aataagccgg gtacagggcc gcgcacctgt ggtcccagct 20280 
actccggtgg ctgaggtgaa agaatcacct aagcctagga gttcctggct gctgtgagcc 20340 
gtgatcaggc caccgtgctg cagcctgaga gacagagcag gaccctgtct caaaaaaaaa 20400 
aagggggggg gggacccagg tgtccagatg tggtggctca cgcctgtaat cccagcactt 204 60 
taggaggccg aggcaggcgg atcacgaggt caggagatca agaccatcct ggctaacacg 20520 
gtgaaacccc gtccctacta aaaatacgaa aaattaaccg ggcgtggtgg tgcgcgcctg 20580 
tagttccagc tactcgggag gttgaggcag gagaattgct tgaactcggg aggcggaggc 20640 
tgcagtgagc caagatcgca ccattgcact ccagcctagc aacagattga gaatccgtct 20700 
caagaaaaaa aaaattgctg aaataaaaag acaagcgtga tgtccgcctt cagagtgctc 20760 
caaaactcag gagatacttt taggattaac agttgagagc tttgttttgt tttgttttgt 20820 
ttttgagatg gaatttccct cgttgcccag gctagagtgc aatggcatga tctcggctca 20880 
ccgcaacctc caccttccgg gttcaagcga ttctcctgtc tcagtctccc cgggttcaag 20940 



cgattttcct gcctcagcct cctgagtagc 
gctaattttt gtatttttag tagagacagt 
actcatgacc tcttgatccg cccgcctcgg 
gccaccgcac caggcctcgg acccttgacc 
gtgctgggag tacaggcgtg agccaccgca 
gcccacctcg gccacccaaa agtgctggga 
gttttttccc tttataaagg ttctcccgcc 
acaggatgtg ggacagaagc accggcggga 
ccacaccacc agccaggcca ggacgagggc 
ccaagtttcg cagtcatcca gtctccatta 
agctgagtca gttcagggtt cacatcctgg 

gggggggggg tctctccaga cataatcttg 

cacctggtgg ggttctgggg agggggaatt 
aaggacaggg cccatgagcg gtgcaagaga 
gggacccgag aggtggagac cgccctcagc 
tcccggcagc agggagcgga ctctcctctc 
ccagcagctc cacgtggccc cagtccttcc 
cggacgagct cctcctgggg ccgcagacgc 
cggctgagcg gggccgggcg tgtgggcggg 
gaagggcaaa ggagccctaa acgcagcgga 
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tggcactgca ggcgttcacc accatgccca 21000 
gtttcaccat gttggccagg ctggtcttga 21060 
cctcccaaag tgctgggatt acaggcgtga 21120 
tcttgatccg cccaccttgg ccacccaaaa 21180 
ccaggcctcg aacccccgac ctcttgatcc 21240 
ttacaggcgt gagccaccgc acctggccag 21300 
tctcccttcc cggctgccta arggacgcag 21360 
agcaagcaca gggaagctcc cacctccctc 21420 
ctgccaccgc tggagcctgg gctgtccctc 21480 
ggcgcctacc ccccagagcc aagccaggac 21540 
ctctgcacat gtggccttgg cggcggggcc 21600 
ggcctcacct atgtccct^^g aaagtgggag 21660 
acgagagctc caggaaggag cctgctcagc 21720 
tgtttcagca acgccgtctg ggcgtgtcct 21780 
ctgtctcaga atctgagcct ttgccttttc 21840 
ccgggccgcc gtgggggtcg cgctcaccct 21900 
tgcggtcttg gtcttgctcc tgggggctgg 21960 
caccggcggt ccctgcggga aagacgagag 22020 
ggcctccata aaggcagaag ccgaagggtc 22080 
aactctcgga gcacgggctt aagttggaaa 22140 
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gaaactaaga cagcgaaggt ggaagggccc cgccgcggcg aacacgggcg cggaaccgcc 22200 
gagagagggt tcctcgcact cgaggtgcag caggtcaaag gttaagagcc ctaaacacca 22260 
cacctggggt caggaggctg cataagaaac cacgagtcaa aggtcagact gcacggagga 22320 
gcctcagtcg aaaagcgggc aagggcgagt ggaaagcggg gccgggtcgg tgggctgcgc 22380 
acgcccaggt gcaaagaggc aaaggtcaaa gcgccaaagg ccccggccgc gcggggagga 22440 
gcccacgccg tggcccccgg gctgcctggc cgtctccctt tgtgttacct tctttgccgg 22500 
ggagtCGcgg gcggccgcaa ggccgtaggg ctcgtttgag ccccgccgct ccgcggcccc 22560 
agcaaagtgc cgacattacg cacgccgctc caggccaccc caccggcccg cgcctgcgca 22620 
tgcgcccgcg ccgcctgccg ggagttgtgg tttcatggtc gacggaggct gcgaagggaa 22 680 
accccagccg gaagtagact cccaggatgc agcggaggcg cgaaggcatg cgccggtgga 22740 
cgctctgatt ggttcctcct gctgttttta aagggagggg gcgggacaga gctgttgccg 22800 
tggcaactgg gaggcactct caggctgttt tcccgaggac ctcaaatccg gacttttttt 22860 
ctgtttttct ttcttttttg gttttgtttt ggacgcgttg tggcccaggc tggagtgcag 22920 
tggcgtgatc atagctcagt gcagcttcga actgctgggg taaagagatc ctcgcccctc 22980 
ggcttcccaa agcgctggga ttgcagacgc cgccaccgtg cccggctttt tttttttttt 23040 
tttcaaggca tactcatcta ataacgagga cagcatctgc aatttagaga ttcctgtccg 23100 
caaccttcat tgctccaacg acaacttttg ggtaagagtc attaggatgc cgtctatcat 23160 
ggaggaagct gaggctcaga gagggccacc aagttgctgg aagacacagc acgtgcgacc 23220 
tcagggaggc tgcaaggaga gaaagcccca gtccgcgaga ctcccagcct ccagcttcag 23280 
tttaccctcc aatccccaag ccctcagggg caggagccga atggagcggc aggcttggat 23340 
tcacctgcta agtggggtga ggtcaaggga atgaaataaa cctcggagcc tagagcctgc 23400 



cctggtctcc gcgtgatcct gcctaggagg 
gaaggtgtgc ccacctgtgt cgttcagccg 
tgtggggcag taggcttggg aagggtgaga 
ggctaatatc ccagttgcaa aggggagctg 
ctttgggagg ctgaggcggg cggatcacct 
tacgcaagca tgcctggcaa catggcaaaa 
ccgggggtgg tggcgggcac ctgtaatccc 
gcttgaaccc gggaggcgga ggttgcagtg 
gggtgacaga gcgagaacct gtctcaaaaa 
gcctcaggcc tgtaatccca gcactttggg 
ggagttccag acaagccttg ggcaaccgag 
aaaagaaatt agccaggcat ggtggtgcac 
ggcaggagga ctgcttgagc ccaggaggtt 
gtactccagc ctgggttgac agaacaggac 
cctgaggcag gaacaagcgt ggacagagga 
ggagcaaaga tgtagctggg gctcagttag 
tcagctcaag ctatcctcca tccccaaacc 
acgaggactc gacctcactc tggaagggcc 
ctgctgccga cgcctgcgtg agcgcactga 
ggcagcaccc tccctcacct gcctcagtcc 
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agcagggcgg gagctttaga atggaacctg 234 60 
gggcagcagg ccagaggcgg gagcgcctgc 23520 
ataggaatat ctgggggtaa ctgtgttcca 23580 
gtttggtggc tcaggcctgt catcccagca 2364 0 
aaggtcagag ttcgagacca gcttggcaaa 23700 
ccccgtctct agtaaaaata caaaaattat 23760 
agctactcgg gaggctgagg caggagaatc 23820 
agccaagatc tcgccactgc actccagcct 23880 
aaaaaaagtg caaagggagg tcagttcagt 23940 
aggctgcggc gggaggatcg cttgagccca 24000 
atactgagac ccagtctcca ccaaaggaaa 24060 
acctgtggtc ccagatactc gggaggctga 24120 
tagactgcag tgagctgaga tggcgccact 24180 
cctgtctcaa aacaaaacaa gtgcaaaggc 24240 
gcaatttgag cagagtgggg ctggggagag 24300 
ggggcctgac cacacggggg ctcgggggcc 24360 
ctggcacttc agtttcccca tcagcccaga 24420 
tggcagcctc cttacagcac attccagacg 24480 
tgccaccggc tgggaatgtt ttcgacagac 24540 
acctcagggt gccccagcgg gctgtgacct 24 600 
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cagacctcac ccactactgg ggtcacctgc ctggccctga atcagccagg cctggtgtgc 24 660 
caagacctac agacaccccc tgcacccctg caggctggca gagccagaaa cttgggtgga 24720 
aaccgacttc tgaactattt caccattcct tatgcgttag tcttttcttt tatttgatga 24780 
gatcccagca ctttgggagg ccgaggcggg cggatcacgt gaggtcagga gtttgagacc 24840 
agcctggcca acatggtgaa accccgtctc tactaaaaat acgaaaatta gccgggcatg 24 900 
gtggcctgtg cctgtaatcc cagctactca ggaggccaag ggaggaaaat cacttgaacc 24 960 
tgagaggtgg aggttacagt gagccaagat cgcaccactg cactccagcc ttgggcaatg 25020 
tagccaaacc ccatcactac aaataataca aaaaaatttt gttggctgtg atggtgcctg 25080 
cctgtggccc catctacttg ggaggctgag gtgggaagat gtagaattgc ttgagccagg 25140 
aggcagaggc tgcagtgagc tgtgattgag ccactgcact ccagcctggg cgacagagcg 25200 
agaccctgtc tcaaaaaaaa aagaacataa tctgggtttt ggaataagac agcagtttct 25260 
gaaacagctc attgcccaaa ttccagcctc gcaactctgt agccgccacc accccccagc 25320 
cccaccattt attttaacta catctgtctc caccactcct gtattaagta aatgcaatat 25380 
tggctggtca tggtggctca tgcctgtaat tccagcactt tgggaggctg aggcaggcag 25440 
atcccctgag gtcaggagtt cgagactggc ctggccaacg tggtgaaacc ctgtctccac 25500 
taaaaattca aaaattagcc ggacgtggta gtgggtggtg cctgtaatcc cagctacttg 25560 
ggaggctgag gtaagagaaa tgcttgaatc caagagactg aggttgcagt gagctgagat 25620 
ctcgccgctg cactccagcc tgaacgacag agcgagactc cgtctcaaaa ataaattaat 25680 
aaatacaaca ttaattattt ttcttgctta agttttacga agagacttaa tatcaccatc 2574 0 
aaaagtggga aaccatatat ctggccgggc gtggtggctc ccgcctgtca tcccagcact 25300 
acgggaggcc gaggcgggcg gatcccctga ggccgggagc tggagaccag cctggctaac 25860 



atggtgaaac cctcatctcc aataaaaata 
gtaatcccag ctattcagga ggctgaggca 
ttgcagggag ccgagatcac accactgccc 
ctaaaaacaa aacaaaacaa aacccaacca 
atggaagggg atggcccagg tccctggagc 
cttaaagcaa cgttcatttt cttacatttc 
ggggctgaag tcaaggtgtg tggaggcctc 
tgcctctccc agcttttgaa ggctccaggt 
ctcggtctct gtggtcgcac tgcagcctcc 
tccgtattgt gaccgcgttt aggatgcccc 
tcatggtgtc aatatattga gactcttttt 
tcagggtggg gacttacctt tgggccaacc 
aatacaaagc gtgcgccagc ccagccctga 
gacagagtct cgctctgtcg cccaggctgg 
acctccgcct cctgggttgt atagattctc 
caggcgtgaa aaggagcaag gctctgcccc 
catgctcagt gaaagacgcc agacacagaa 
aaatgtccac aacaggccca tccacagagg 
ggcagagggg atgagtgaca gctgatgggg 
actagacagt cgtggtggtt gcacaactct 
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acaaaaatta gccgggcatg gtgggtgcct 25920 
gaagaatcac ttgaacccgg gaggcggagg 25980 
tccggcctgg gcgacagagc gagactctgt 26040 
agcaaacccc acagagtcga gaatcgctag 26100 
ccctgtgaca aattaccaca aactcggtgc 26160 
tggaaatgaa aagtccaaaa tcaggactgc 26220 
gctccctcca gaggccctgg ggctccrtcc 26280 
gtgcttggcc tgcggccaca tcactcccgt 26340 
tcgtctgcct gtgtgaaatc tcctcctgtc 26400 
aggacaatct tctccatatc gttcagatct 26460 
ccaaataagg caaatgtcac attctaggga 26520 
acagaggcta caaagaggaa gacaccactc 26580 
tcggtgtttg ttgttgttgt ttttgtttga 26640 
agggcagtgg catgatctca gctcattgca 26700 
ctgcctcagc ctcctgagta gctgggatta 26760 
agccacagcg cggatgcacc ttgaggatgt 26820 
ggacacacag tgtgtgatcc cctttatatg 26880 
caggaagggg atgtgtgggt gccgggggct 26940 
cttcttcttg cggtgatgga atcttctgga 27000 
acgaggtact aaaatcactg aactggctgg 27060 
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gtgcagtggc tcatgcctgt aatcccagca ctttgggagg cagaagcagg tagatcacga 27120 
ggtcaggagt ttgagaccag cctggccaac atggtaaaac tctgtctcta ctaaaaatac 27180 
aaaaattagc tgggtgtggt ggcaggtgcc rgtaatccca gctactcagg aggctgaggc 27240 
aggagaatcg cttgaaccag ggaggcagag tttgcagtga gccgagatcg caccactgca 27300 
ctccagtctg ggtgacagag ccagactccg tctcaaagaa ataataataa aataaaatca 27360 
ctgaactgta cagtgtaagt gggtgaattg tgtggtatat gagtgatgtt tccgaggtgt 27420 
cattaaagaa actcagacgc ctggggtggg gccagtctca ccgctgtggg tcccatcccc 27480 
atcatttctc acaaggccct cagatcaccc ttccgcggtg gggggcggac actctaagaa 27540 
gggaagacct gggctcctgc tggcgagaag gcggtggaca tttcttcagt gtctggtgcc 27600 
gcgccctctg cccagcgtgc tccgtggagg gtctcattgt cttcctccag acgtctcttt 27 660 
actggcccat tttacagagg cggaaccgaa gcttggggtg ttggccacag ggctctagtg 27720 
tgggaagcca ggccaggctg gacctcagcc atggggaccc ctgtccctga gactgtggca 27780 
cctgccacac cctctgtgtg acccgcctaa gccaggaaga gagggtcagg agatgcctga 27840 
gccaccaaga aggcatccca gcgtccagcc agaccggtta tccctccaga gggctccccg 27900 
gcaggacagg ctggtcgcca tgtcttcagc ctggtgctat ttaaaggtgg gtgccacctg 27 960 
gggctgtggc cgcagggcca ggactgggct gctgggagct gtgtccccac agcggaggtc 28020 
gccgcccctc tcaggcctcg gtttccccag ttgtcaatgc ctccacttgg ctgtgagtct 28080 
gtgagggtca ctgtgctcac cttttggggc ccagcgcatg gggcaggcag aggaagggtg 28140 
ggggccagcc gccttgctgg gtggttcccc gtggggcctg gggtatggct ctaagggagg 28200 
agcaagtgtg ggtgcgaatg gggccgcccc attcctgccg cctccgacgt gccccgccag 28260 
ccggccaccg acaggtctac gtggctatcc tccctcctgc ccacctacct gcccaaacac 28320 
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acgtccccag tcgtcacctg cccacccacc 
tttcgggaaa ctacaatttg cggggagaga 
ctggtcccac ggctgacgca cgcggcagga 
ctccgcctct gcccctctct gtctctgtgt 
tgtctcttga ctgccgccgt ctttctgtct 
aggtctctgc ggcccgcgtc tcacactccc 
ggggactcgg ctgactcacg gacacgcccc 
agcgcggagt cccgcacggc cgcgcccctg 
attggccggc gccctagcct ggtcccgccc 
ccattggccg gcgtccccgc cccagcgaac 
cccgccgcgc gaaggcagag ccgcggacgc 
tgccgggtca tgcgccgccg cctgtggctg 
ccggacgccg cgggaacccc gagcgcgtcg 
ggcgacgtgc gctggcggcg cctcttctcc 
ggcggccgcg tgcagggcac ccgctggcgc 
ggggcctggg gtggggaggc ggcgggtgac 
tgcgcccgcg ggggagtccc ggaggctcct 
ctgtgagatg ggtgcagcct gcctggtggg 
cggcggaccc ggctcagggg cagagaagcg 
acccatctgc aggtgggaaa ctgaggctcc 



cgcgcattcc cacacccttg tgggcctggc 28380 
agtcccacga gggcatgccc cggagcctgg 284 40 
cctcccgtgt ccatctctgt ccccaagcat 28500 
ctctctcgtc tctcccggtc atcttccttg 28560 
ctgtctccct ccgggtctct gtctccctcc 28620 
gcccccgcaa cccgaggtcc tagcccgccc 28 680 
gcgagacaaa caacaaacgc gcggaggccg 28740 
tgcacctggc ccccgccccc gagacgtccc 28800 
aagtggaccc cgcccccgcc ccgaggcacc 28860 
ccggccccgc ccccgaggcg ccccattggc 28920 
Gcgggagcga cgagcgcgca gcgaaccggg 28 980 
ggcctggcct ggctgctgct ggcgcgggcg 2 904 0 
cggggaccgc gcagctaccc gcacctggag 29100 
tccactcact tcttcctgcg cgtggatccc 29160 
cacggccagg acagtgagtg cggggcggcg 2 9220 
ggcaacgcgg ccgccgtctt cacggtgacc 29280 
ctgtgcagcc tcggcctcag tttccgtggt 29340 
agggttgcac tgttaaagcg aaggctgcag 29400 
tccgtgtggt acaaccctgt gggtggggcc 29460 
agaggggctg gggcaggccc agctgcatgg 2 9520 



cggaagcggc ggggggctga cctccggact 
ctgagtcggg gccccctctg cttcttccca 
ggcacacaga agggatggga cctgcccagg 
ccccacaggg cccaggacgt cacggagcgg 
actgaggtag gccctcagtg tttgtggaat 
caggggatgt gggtacctgg aggccagggg 
gggaaggggg cccgggcccg ctggctgccg 
accccctgtc cttcccaccg ctttcatctg 
acgagggcgg acaggggact ggctgggccg 
gtggcctcgg gccggggcag aggcctggct 
ctctccaagc ctcggtttcc ccagctggac 
ctcctgccct tcgtgcattc gctggtcact 
catgggggac ccagtggtga cagagacgcc 
cgcagcagtt agacacgtga acaagggcgc 
ccgggtgcag tggctcacgt cggtaatccc 
acgaggtcag gagatcgaga ccatcccggc 
atacaaaaat tagccgggtg tggtggcggg 
aggcaggaga atgacgtgaa gccgggaggt 
tgccctccac cctgggcgac agagcgagac 
cagtgaatga cgtgaacaag ggtgcaggtg 
gaggcacctt gccaggggag gggaggtgca 
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cctgacatca cagaatccag tcagggctgc 29580 
gacaccccat ctggcaggtg aggacaagga 29640 
gtcacactga caggggtggc ggagctgggt 29700 
gcgtctctgt ccccagggtc tgccgagcac 29760 
gtcaggagca agaggagagg ctgggcacag 29820 
agtcggtgtc cccgccgggc ggggggcact 29880 
cctgaatcac caccatcagg gcaggtaatc 29940 
ggcgccaagg ccctcattag gccgcacgtg 30000 
gtccatccat ggcgggcatg gccaggcggg 30060 
ccgctgcctg acctggaaca gtctctgcct 30120 
ggtgatgggg gtgagggcta gctgagggct 30180 
aatcgggcac cttgtgggtg ctgtgctccg 30240 
caccctcctg gggctcccag agcagaggcg 30300 
aggtgggtgc acagaacagt gaacggttgg 30360 
agcactttgg gaggccgagg cgggcagatc 30420 
taacacggtg aaaccccgtc tctaccaaaa 30480 
cgcctgtagt cccagctact cgggaggccg 30540 
ggagcttgca gtgagctgag atcgcgccac 30600 
tccgtctcaa aaaaaaaaaa aaaaaaagaa 30660 
ggtgcgcaga acagtgaacg gcggtgttgg 30720 
gggcgaggaa ggggccaggg gagatcgtga 30780 
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cacagacgcc ccagaacaac cacctcaaag acgttcctgt gtgtcctgga aggtcgggct 30840 
gggaggctgc cccgaggagc tttcactttg acagggagct ggccgggcac gcagggaect 30900 
gtacacccag ctgacaaagc ggcagacacc caggccgggg tgagcgagtg tgggtgagga 30960 
gtggcggctg gccccagggt ccttgctgga caagacactt cagctcaggg tggggcaggg 31020 
ctcacccagg gctacccaca gacgatggcg tccaaatctg gctctgccac tcccaggcct 31080 
caactggccc ctctgcaacg tgggctgctg agcgggcttg gtaggacagc tggcatacag 31140 
tcggcgctca agcatgtctg tggtgtccca taaaccaccg gtgtcccact ctaggccact 31200 
gccagcccgg cctccagtcc agagtcccag tccggagtcc cagtgactgt gcgtgggccg 31260 
ggcagctgag ctgtgagggc cgggctgggg gctccatatg gggtggtgtg agctgtgagg 31320 
gccgggctgg gggctccata tggggtggtg tgagctgtga gggccgggct gggggctcca 31380 
tatggggtgg tgtgagctgt gagggccggg ctgggggctc catatggggt ggtgtgagct 31440 
gtgagggccg ggctgggggg tccctggggt ggtgtgagct gtgagggccg ggctgggggg 31500 
tctctggggt ggtgtgagct gtgagggccg ggctgggggc tccatatggg gtggtgtgag 31560 
ctgtgagggc cgggctgggg gctccatatg gggtggtgtg agctgtgagg gccgggctgg 31620 
ggggtccctg gggtggtgtg agctgtgagg gccgggctgg gggctccctg gggtggtgtg 31680 
agctctgagg gccgggctgg ggggtctctg gggtggtgtg agctgtgagg gccgggctgg 3174 0 
gggctccata tggggtggtg tgagctctga gggccgggct gggggctccc tggggtgctg 31800 
ctggtcgctg gctcattgac agttatcagt ggtctgggtg ggccctgccc cttctgactc 31860 
ccacatccca ggaacccttt cccaaccttc ctcgtggtgt tgctgccccc ctgacgtccg 31920 
tccctctggg tgtgtgggag cccccccgcc atacacacac acagatgctg ctcttgggct 31980 



gagctgcagg 
ccccagctcg 
cacatgctgg 
gtggtcggga 
ccctgcacga 
gacccccgcc 
t caaagcagt 
cggtgagtgc 
cccccgcccg 
cggccacaac 
gctggacagg 
ccacttcctg 
ctgggctggt 
ctgagtgccc 
cagcctgagg 
ccaacagggc 
aggcagggcc 

gaagccggtc 
caggtgggac 
cagggtcctc 



gacagcgctg 
tgggggccgt 
ggggggctcc 
ggggactcgt 
agcacagcgg 
tcccccaggc 
gtcctcaggc 
cgggcagggc 
cagcgactct 
acctacgcct 
aggggggggc 
cccgtcctgg 
ggcgaggggc 
accgtgtgcg 
gggtggtggc 
gcctctcggg 
cggcagaggc 
aggggagggg 
ccgtcccggg 
ggagatggag 
acgacaggag 



acctggccct 
cctgcttccc 
tggtgtcagt 
ggtcccgggg 
acagcagcgg 
atcctggaga 
ttctacgtgg 
tgggcggcgc 
acaccgtgga 
cacagcgctg 
cccggccagg 
tctcctgagg 
ccggccacgc 
ggcgctgtgg 
caccaagcag 
ctgaaggacg 
aggggctcgg 
gccggggaga 
aatctgcaaa 
gagcatccag 
gcgggacagt 
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cccacggggt 

gttccctctg 
cacggctctg 
gtctcctggt 
tgctgggggt 
tccgctctgt 
ccatgaaccg 
gggcagggtg 
ctgcaggttc 
gcgccgccgc 
cggccggacg 
ccctgagagg 
ttgttcttcc 
acacagccca 
gttcaatcct 
cagacgtcga 
ggtggggagc 
cggtgacaga 
tacaacgcct 
ggtggggggt 
gagagccaga 



cctcatcgat 
cctgctcctt 
ggggatcctg 
atctgtcgtg 
gagccagcaa 
acacgtgggc 
ccggggccgc 
gggagggtgg 
cgggagcgca 
ggccagccca 
cggcggtacc 
ccggcggctc 
ccctgcgggc 
ggagccctcc 
gagttgggga 
aaggtcgagg 
acgttgggag 
cgccgcagaa 
tgcgaggaca 
ccagggcccc 
gagagatggg 



ctctgcactc 
gctcctccct 
agtgtccgtc 
gtcctgaggg 
ggccctcccc 
gtcgtggtca 
ctctacgggt 
gccggcctca 
tcgaagagaa 
tgttcctggc 
acctgtccgc 
cccaaggtgc 
tctgtaagcg 
aggggggtcc 
cctcgaggac 
gggacgtccc 
tgggggcagg 
caccagcctc 
aaggcacctg 
agtgr,cctca 
gatgggccgc 



32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
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gctgtggccg tgaaggggag gaagggccct aagctgaggg acgtgggtgc ctccagatgc 33300 
tggggaaggc gggaacggtt ccgcactgga gcccccggga gggaccggcc tgctcctgcc 33360 
ttgatatgag cccagtggga cccagtttgg actctggcct ccagaaccgc cagaaaataa 33420 
acgtagtaag ccatcaactt tgtggtcttt tgttacagca gacgtcggaa atatgcacac 33480 
ggtgtctgaa actgttctca tgacaaaata agcctcagat cccccgggga agggcggagg 33540 
ccaacgcctc ggtgttcctc cgatcccccg ggaagggcgg aggccgacgc ctcggtgttc 33 600 
ctcggatccc ccgggaaggg cagaggccga cgcctcggtg ctcctcagat cccccgggaa 33660 
gggcagaggc tgagggcagg agccgtgctg ggtgcagggc aggcctgggg gcttcatgcc 33720 
gctgtcctgc gggacgcaga gagggctggc cgtcggtgtg ggggcgcccc cacctgtgcc 337S0 
cagcgccctc ctgacatcct gactccgctg ggacttctgc ctacagccct gggagtcaaa 33840 
ctccagcctc tcagagaaaa ggtcagagcc aagagcccca cagcctggag ccaggcagtg 33900 
acaccctggg cctgtctccc cttctgtgtg tggggcgaca gcagcatcgc cctggtgaag 33960 
tccccgggga cggccagggc tccatcccca gccgccgcct tccacataaa tacaggaaga 34020 
ctgggccgag gcacttgctg ggaggtgctg agcagcctga cacggaaaac ccttctggga 34 08 0 
agggagggtc gtgcccggcc cgagagcttc tgctcaccct gcagacagaa gcgagcccca 34140 
ccccagggga caccaggcgg cctctgggga catctttggc tggcatggag tgggtggagg 34200 
acagggctgc acccaggatg tccccaggtt ggcagtgtga ggggagatcg gcccacgttg 342 60 
gccagtcgga gggcgtcgcc acttgagttg tcactgggag ctgcacaggt caccacagct 34320 
gaaataaaac ttgctggcac cccacgcagg aacgtaacat gtgcctcgaa gaaacgggtc 34380 
agcaggccgg gcgcgggggc tcacgcctgt catcccagca ctttgggagg ccgaggcggg 344 40 



tggatcacga ggtcaggaga tcaaggccat 
ctaaaaatac aaaaaattag ccgggcgtgg 
gaggctgagg cggggaatcg cttgaatccg 
cgccactgca ctccagcctg ggcgacagag 
aagaaacagg tcagcagttg tttctttgtt 
ctccgcacat cccagggcag tgaaatcccg 
caagcccaaa cctggggacc cccgttgtcc 
agaggagcgg cctgggggca cagaaccagc 
taagacaacc ccaaactgtt gcaaactgtg 
gggcggatgg ggagagggcg gctgcaggcg 
cttaccaggg ccctgtcctg tcgtgcagca 
ggccacttct tactgctgtc tgacctcgag 
ccggtgtcct ctgggcccag cgcccccgag 
tcctgctcct ctggccactc cacgcgaggg 
ccggcagcat ctctgacaaa ggccctccag 
cctcacctgc ccagccctcc tctccagctc 
acacagaacc cgagccttgt ctccctcaac 
aggaaaacgg ctcggccggg ggactgtgac 
actcggggcc taaaacactt gttctctcag 
tccctacctc ggtgtcctcc atgcaacctc 
aggagccgag tccgcgggtc ctggggttga 
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cttggtcaac atggtgaaac cccgtgtcta 34500 
tggcgggcgc ctgtaattcc agctacttga 345 60 
ggaggcggag gctgcagtga gctgagatcg 34 620 
cgagactccg tctcaaaaaa aaaaaaaaaa 34 680 
tctaaaacag agcgtggaat gggcgtacag 34740 
gttcacacag agccctcagc agcttattcg 34800 
tcaggcagtg aggtgggggc cccccaacag 348 60 
ggctccccag gaaatcgcca gcagtgaaaa 34 920 
cttccgctta cgaagcactc ctgagcggca 34980 
cgaggggccc ggggacgcag gggtgcgggc 3504 0 
ggctcctggg gcagggaaga caccaggggc 35100 
caatgcggcc tcacagcccc caccagggtg 35160 
gctcatgcct gggtggggcg aaccaatcgg 35220 
aagtcccagc ctcacaggca ggcgcacacc 55280 
ttccgagtct ccaggtcccg ccgctgcaag 35340 
caactccaac tcccaagaac caccacggac 35400 
gcctcctgac tcaaaactcc atcttccaac 35460 
ccggagcagg cggcccagcc tgtcgcgcag 35520 
tccggagatc aaggacgatc cgaggtaacc 35580 
gtcttagggc accgggtacg ttacctcgtg 35640 
gatgtggacg ccctcagggc tggcactctg 35700 
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ccctggcggc cacagtcatg gaagtcccaa cgcttctctc ggctccgcaa ccccagaggg 35760 
cggccacgag gagggcccgc cacgcacgac cccagagggc ggccaccagg agggcccgcc 35820 
acgcgcgacc ccagagggcg gccaccagga gggcccgcca cggcgttgcg gcagcagccc 35880 
agaaggtgcc ctgcgcacgg tccggacagg tgggatccga gttacctggc caagggggct 35 94 0 
gacgcagaca cgtcgcggga cacagtgaag agtgtggtgc agagcggagg gcgggagtct 36000 
ttggagaaca ggtaggggcg tggggcacgc gcctcccacg cgcaggagcc gtctaccgtg 36060 
gagggacacg ggtggtcctg ctggaggctc ctctccgtta gctgtctcca tcgtctgatt 36120 
cttggatccc aggatggtgg gatcatcagc aactgagatg aacccactgc cccggccccc 36180 
tgagcccgca ggtccccacg ccttgccagc tgtgcccgag ctggctgcac cccgggccag 3624 0 
gcatccagca accttgagca gtggggtccg gcttttcaga aggggccagg aacccgcgtg 36300 
gctgaggtgt gaccgaagcg tggggcagag gcgctgggcc ctggcgcrtt aacgctggtg 36360 
tttctggttt taaatttcac gacccagtga cactgccacc ctgctacctc gccagcagcc 36420 
ctcctgggct taacttcggg agagcagttt tgctagccgg ccctgggtgc caagccctgc 36480 
aggaggcgca gacccctgga gacaggaccg gactctgcag agcccgacca gcctcccagc 36540 
ttggcctttt cctgacgcac gggcgcagaa ggaaagccac agcaccggct tctctttgta 36600 
agtagtgtat tttaaatagc tttcaagata cacatatttt ttcctttaaa aaagtctgtt 36660 
ggagcagttt tgttcttgaa ttttgctggt catcctcatg gtcccgagcc cccctactcc 36720 
gggtcgtgga ggcggccgag ggggaggctg ggggcccacg tggcccgtcc tggcggcacc 36780 
tgcagcactg ggggagccgc tgaaccccgt gcttcagcgc tgggggagcc gctgggcccc 3684 0 
gtcttccgcc acaaaccatg catggccgcc acgtgagctc aaacgtccgt ttatttcaaa 36900 



gcagtaataa tttaaaatta taaaaatctt 
agacagagga cggggaggct ggggacgccc 
aagccagggg gccggtgggc cgggcccggg 
gctgaccagc caagcgtggc gccgccgggg 
gctgagggtg ggtgggaaag gccccgtgct 
gcttgtggcg ttctcgttgc tgggcgagct 
gctcacccca gtgaggccag gggggttcgt 
gcagacgggg cagctgtcgt gctttgtggg 
ttagaggcgg gagggccgcg gtcggggtgg 
ccagccaggg cacgatgcag ccgtcgtgga 
gctcacccag cgcgtagtcg tccttgcaca 
tgtgcagctg cggtcacagc gggcgtgggg 
ccacagcctc agccggaggc cgcccctgag 
tcccccgggc tgccagaggg gacctggaca 
aaatgctcga ggccgggtgc ggtggctcac 
ggcaggagga tcacctgagg tcaggagttc 
tgtctctact gaaaatacaa gtatgagcca 
tactcgggag gctgagctct catacctacg 
gcctggattt tctctttatc tgccggtggg 
tgaaagacaa gaggcgagag tgccgggagc 
cgcgaggctg caggacctgc aaaagcaccg 
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tccaccgctg aacgtttaga gggtgaggtt 36960 
cagaggggac catgtggccc acgccttccc 37020 
tcctgccctg gaacaggcgg gacctgcagc 37080 
cacccagtct gtgggtgccg tgtggcgctg 37140 
ttcccgacgg ccgacgtggg ctcacgagtt 37200 
ggaggaggac gatgacgacg aggaggagaa 37260 
ggccgtgttc tgtcccgtga ggctttttcg 37320 
gacagaggca gggacgggag aaggggcagg 37 380 
gggggcgggt gggcggggca ctcacctgct 37 440 
acaggtggtt gcagggcagc tgccgcacac 37500 
cagggcactc gagcccggag cctgcgggag 3^560 
ggcctgccga gccttcaagg gcaggctact 37620 
cccagcgagg ggagaaaagc cgtgtgtgtg 37 680 
gaaccctctc ctcccagccc accttcaggg 37740 
gcctgtcatc ccagcacttt gggaggccga 37800 
gagacctgcc tgaccaacat ggtgaaaccc 37860 
ggcgtggcgg cgggtgcctg taattcccac 37920 
tgctcctcag tgacggggac ggtggggagg 37 980 
gggcctgtgt tttcaaactg attgaggagc 3804 0 
tcctcggggg cccggcccgg ggctctgaaa 38100 
aggccgcgtt tgtcctgggc cctgggcccc 38160 
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ttggagcccg cccggggtcg gagatc 38186 

<210> 39 
<211> 720 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 39 

cgccggcgct tgacctgact ttcatgaatc gaaaaggaaa tcctctatga acgcactgca 60 

tcgcatcggc gccggaacgc tactggccgt gttgctcgct tttggcctga ccggctgcgg 120 

ggagaaggag gaggttcagc agtcgctcga gccggtggct tttcacgact ctgacgagtg 180 

tcacgtgtgc ggcatgatca tcactgactt ccccggcccc aagggccagg cggtcgaaaa 240 

gcggggagtg aagaaatttt gttccaccgc cgaaatgctt ggttggtggc tgcagccgga 300 

aaaccgtctg ctcgatgcca agctctacgt ccacgacatg gggcgcagcg tttgggaaaa 360 

gccggatgac ggtcatctga tcgacgcaac cagcgcctac tatgtggtcg gtacgtcact 420 

caaaggcgcc atgggcgcgt cgcttgcaag ctttgccgag gagcaggacg ccaaggcgct 4 80 

tgccggcatg cacggcggtc gtgtgctgcg cttcgaggaa atcgatcagg cgctgctgca 54 0 

ggaggctgca agcatgcagc acggcggcat gcacgaccat gcgccaaacg gtgcacataa 600 

cgcacacgca ggccactgag cagcagtggt ctgaacagca cacacaagaa atcgaggtaa 660 

gcacaatgat gggtatcagc gtctggcaac tcctgatcat tcttctgatc gtcgtcatgc 720 



<210> 40 
<211> 127 
<212> DNA 
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<213> Unknown 
<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (101) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (119) 

<223> May be any nucleic acid 
<400> 40 

gcggccgcnc ggcgctggct gctgtgcgga ggccacggcg ggccgcgagc cgcctcgtcc 60 
tcgccctcct gccctgggtg cggccccccg ggtcccggcg ncccactcgc cccggcgtnc 120 
ccgcgct 127 



<210> 41 
<211> 6858 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknovm Organism: Unknown 
<400> 41 

actcgccaag tgatcgaccg gcccctgagg gccgcgacgc agagggcgcc ccgtgcactg 60 
gcacaggcgg ccttgtgcgt tagactctga tattcgtgcg ccctctcgtt ggcaggacca 120 
tccatcctgt gtgccggggg ccgcgcacac cgatcccgga tccgcctcgg ccctgccctg 180 
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cgcgcccctc cgttctcgac ctccccgacg ctgtctgaac acgcgtcgcc gggggacgac 24 0 
TOcgggcggc ccgcctcggg ggaggggtaa gcgtcccggg atgcccgttc aaccgttccg 300 
caaggctcgc ccatcgtggg ggagaaccgg cgcgacgcta ggagagacaa gtgatccagc 360 
aggagtcgcg gctcaaggtc gccgacaaca ccggtgcgaa ggaaatcctg accatccgtg 420 
tgctcggcgg ttccggacgc cgctacgcag gcatcggcga caccatcgtc gccaccgtga 4 80 
aggacgccat ccccggcggc aacgtcaaga aggcgcacgt cgtcaaggcc gtggtggtcc 540 
gcacccgcaa gcagtcccgc cgtcccgacg gctcgtacat caagttcgac gagaacgcgg 600 
cggtcatcct gaagaccgac ggcgagcccc gtggcacgcg catcttcggc cccgtgggtc 660 
gcgagctgcg tgacaagaag ttcatgaaga tcgtgtcgct cgccccggag gtgatctgac 720 
ctcatggcca agatcaagaa ggacgacctc gtgcaggtca tcagtggcaa ggacaagggc 780 
aagcagggca aggtcctgcg cgtgttcccg acggatgagc gcgtgctcgt cgagggcgtg 84 0 
aaccgcgtga ccaagcacct gcgcgccggc caggacaaca acggttccac cgagggcggc 900 
ctgcaggtcg tcgaggcccc gatccacatc tcgaacgtgg ccgtggtgga cccggagacc 960 
aagaagccga cccgtgtggg ctaccgcttc gagaccgtcg agaaggacgg cgtgacgaag 102 0 
accgtgaagg tccgcttcgc caaggcctcg gggaaggagc tgtgatgacc gaggtgcagc 1080 
agaccgagaa ggtcaccccg cgtctgaaga ccaagtaccg cgaggagatc cgcggacgcc 1140 
tgcaggagca gttccagtac gggaacgtca tgcaggtgcc gggcctcgtg aaggtcgtcg 1200 
tcaacatggg cgtcggcgag gccgccaagg actccaagat catcgacgac gccgtcaccg 1260 
acctcaccgc catcaccggc cagaagccga tgatcaccaa ggcccgcaag tccatcgcgc 1320 
agttcaagct gcgtgagggc atgcccatcg gcacgcacgc caccctccgt ggcgatcgca 1380 
tgtgggagtt cctggaccgc ctggtcacgc tgccgctgcc gcgcatccgt gacttccgcg 14 4 0 
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gcctgtccga ccgccagttc gacggcaacg gcaactacac cttcggcctg tccgagcaga 1500 
ccgtgttcca cgagatcgat caggacaaga tcgaccgcgt gcgcggcatg gacatcaccg 1560 
tggtgacgac cgccaagaac gacgacgagg gccgcgcgct gctcaaggcg ctgggcttcc 1620 
cgttcaagac cgaccagtaa gacctccacg ccacaggtcc tccaccggtg aaccggtggc 1680 
ggaaaccacg gcgagaaagg gcgtgaagca catgaccatg accgatcccg tcgcagacat 17 40 
gctgacccgt ctgcgcaacg caaactcggc ctaccacgac accgtgtcca tgccgtcctc 1800 
gaagctgaag actcgcgtcg ccgagatcct caaggccgag ggctacatcc aggactggcg 18 60 
cgaggaggag gccgaggtcg gcaagaagct gaccatcgac ctgaagttcg gcccgcagcg 1920 
tgagcgtgcg atcgccggcc tgcgccgcat ctccaagccg ggcctgcgcg tgtacgcgaa 1980 
gtccacgaac ctgccccacg tgctgggcgg cctcggcatc gccatcctgt ccacctcctc 2040 
tggtctcctc acgaaccagc aggccgccaa gaaggctggc gtgggcggag aagtcctcgc 2100 
ctacgtctgg tgacgggcaa gacggaagaa aggctgaact gacatgtctc gaatcggacg 2160 
tctcccgatc accatccccg ccggcgtcga tgtgaccatc gacggcgacc gcgtctccgt 2220 
gaagggcccc aagggcccca agggtcagct cgagcactcg ctgcccacgc ccatcacggc 2280 
caccctcgag gaggggcagg tcaccgtggc ccgccccgac gacgagcgtg agtcccgctc 2340 
cctgcacggt ctgacccgta ccctcatcag caacatggtc gagggcgtga ccaacggctt 24 00 
ctccaagcag ctcgaggtcg tcggcaccgg ctaccgcgtg caggccaagg gccaggacct 24 60 
cgagttcgac ctgggctact cccaccccgt cccgg-cgaag gtgtcccagg gcatcacctt 2520 
cacggtggag ggtaacaggg tcaccgtcgc cggtatcgac aagcagcagc aggtcggcga 2580 
gaccgccgcc aacatccgca agctgcgccg ccccgacccg tacaagggca agggcgtcta 2 640 
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cgcgggcgag cagatccgcc gcaaggccgg aaagaagtga tgtctactcr gaaggtgaag 2700 
ggcaagggca agttcaacgc ccgcacccgc cgccacctcc gggtgcgcaa gcggatctcc 27 60 
ggcaccacgt ccgtcccccg cctcgtcgtc aaccgctctg cacggcacat gttcgtgcag 2820 
gtcgtggacg acacgcagag ccgcacgatc gcgtacgcct ccaccatgga ggccgacgtg 2880 
cgtgcgctcg agggtgacaa gacggccaag gccaagcgcg tgggcgagct cgtcgccgag 2940 
cgtgccaagg cggccggcat cgaggccgcg gtcttcgacc gggcgggcaa caagtaccac 3000 
gggcgcgtcg cggccgtggc cgacggtgcg cgagagggtg ggctgcagct gtgaccgaga 3060 
acatcaacca gaaggacact caggtgaccg agagcaccga gaccaccgtc tccgagaccg 3120 
ggtcgggctc gcgagccaga ccaccgagcg cgccaccggt ggccgcggcg gtcgcgacgg 3180 
cggccgcggt ggccggacgg cgatcgtcgt ggcggccgtc ggacgaccga accgtcgtgg 3240 
cgcccaggac gacgaggaag gaccagttcc tcgagcgcgt cgtgggcatc aaccgcgtct 3300 
ccaaggtcgg ccgccgcttc tccttcaccg ccctcgtggt ggtgggtgac ggcgacggca 3360 
ccgtcggcgt cggctacggc aaggcgaagg aggtccccgc tgcgatccag aaggccgtgg 3420 
aggaggccaa gaagtccttc ttccgcgtcc cccgcgtcgg ctccaccatc ccgcacctgg 3480 
tgcagggtga ggacgccgcc ggcgtcgtgc tgctccgccc ggcctccccg ggtaccgcgg 3540 
tgatcgccgg cggtccggtg cgcgccgtgc tcgagtgcgc cggcatccac gacgtgctct 3600 
ccaagtccat gggctccgtg aacgcgatca acatcgtgcg cggcacggtg gagggcctca 3660 
agaagctgaa gagcccccag gccgtcgccg cccgccgcgg caaggccctg gacgagatcg 3720 
Gcccccatgc gatgctgcgc accatggaga acgatcgcgc ccagaagagc gcgaaggcag 3780 
gtgcgtgacg cgtgtttgag tccactcgca agaacatcca gccctcggac gccaccctgg 3840 
tcatcaccca gacccgcggc gtcacgggct ccaagcagaa ccatcgggac accctgcgct 3900 
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cgctgggcct gaagcggatc ggccaccagg tcacccgcaa ggccgacgcg gtgacggtcg 3960 
gcatggtcaa caccgtgccg cacctggtgt ccgtggagga ggtcaacaat ggctgacaac 4020 
gacgccatca aggtccacga cctgcgtccg gcccccggtg ccaagaccgc caagacccgc 4080 
gtgggtcgcg gtgaggcgtc gaagggcaag accgccggtc gcggcaccaa gggcaccaag 4140 
gcccgttacc aggtccgtgc gggcttcgag ggcggtcagc tgcccctgca gatgcgtctg 4200 
ccgaagctcc gcggcttcaa gaacccgttc cgcacggagt accaggtcgt gaacctggac 4260 
aagctctccg cgcacttccc cgagggcggt gaggtcaccg tggacgcgct cgtctccaag 4320 
ggcctcgtcc gtcgtggcca gcccgtgaag gtgctgggca cgggggagat caccgcggcc 4380 
gtgcaggtga aggcgaacgc cttctctgcg tccgccgtgg agaagatcca ggccgccggc 4440 
gggtccaccg agaccctctg acacgccgac ccatcgaccg agggccctgg ccggagcagc 4500 
cgctcgggcc aggccctggt ccgtccgtgt agactcgcac agccgccccg gtgtggccgc 4560 
cgtctcgtgc ccccgccccg cggaacggcg cacgccccac aggaccagcc gcaggaggac 4 620 
tcgtgctcaa ggccatcgcc cggatcgtcc ggacgcctga cctgttgcgg aagatcgcct 4 680 
tcacgctcgg gctcatcgcc gtctatcgga tgggcgactt cgtgccggcc accggcgtgg 474 0 
actacccggc ggtgcagcag tgcctggcag cgggcaacgc ccagggcggc ctgtactcct 4800 
tcgtgaacat gttctcgggc ggggcgctcc tgcaggtgtc tgtcttcgcg ctgggcatca 4 860 
tgccgtacat cacggcgtcg atcatcgtgc agctgctgcg cgtggtgatc ccgcgcttcg 4 920 
agcagctcca ccaggagcgc cgcaggggcc aggcgacgct gacgcagtac acccgctacc 4 980 
tgaccctcgc cctcgccctg ctgcaggcga ccacgatggc ctcgctggcc cgcaccgggg 504 0 
ccctgctcgg atgcagcctg ccgctgctgc gcgacggctc catcctcacg gtgctgctcg 5100 
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tggtcatcgc cctgaccacc ggctgtctca tcgtcatgtg gttcggggag cggatcaccg 5160 
agaacggcgt gggcaacggc atgtccctgc tcatcttcac ctccatcgcg gcaggcttcc 5220 
cggccggtct cggccaggtg gtccagacgc agggctggcg cgtgttcgcg atcgtcatgg 5280 
ggatcggcct gctcaccatg ctggccatcg tcttcgtgga ggagtcgcag cgccggatcc 534C 
cggtccagta cgccaagcgg cagatcggct cacggaccgt gggcgggtcg agcacctaca 5400 
tcccggtcaa ggtgaacatg gccaacgtca tcccggtcat cttcgcctcc tccgtgctga 54 60 
tgctcccggg catcctcatc cagttcaaca cgccgcagga cggcagtgcg ccggccccgt 5520 
ggatcacgtg gctgagccgg tacttcggct ccggtgacca cccggtgtac atggccctgt 5580 
acttcctgct catcatcggc ttcacgtact tctacgtgtc catcacgttc aacccggtgg 5640 
agatctcgga caacatgaag cgctacggcg gcttcatccc ggcgtccgcg ccggccggcc 5700 
ccaccgagcg ttacctgcag tacgtcatca gccgcatcac gttcgtggtg ggggccctct 5760 
acctcggtat cgtggccatg atcccgctga tcgccttcgc ggtgatcggc accagccaga 5820 
acttcccgct cggcggcacg tccatcctca tcatggtggg cgtcggcctc cagaccgtga 5880 
agcaggtcag cgcacagatg gagcagcgcc actacgaggg cctgctgcgc tgagccccga 5940 
cccgatcccg caacgccgtc cgtatcgaca gtgaggaaca cacgatgacc cgcatgctgc 6000 
tcatgggccG tcccggttcc ggcaagggca cccaggccac ccggatcgcc gacaagctgg 6060 
ggatcgtccc gatctccacc ggtgacatct tccgccacaa cgtgaagtcg atgacgccgc 6120 
tcggcgtcga ggccaagagg tacatcgaca acggcgactt cgtccccgat gaggtcacga 6180 
accgcatggt cgccgaccgc atcgcccagg ccgacgcgga gcacggcttc ctgctggacg 62 4 0 
gctacccgcg cacgaagggc caggtcgagg cgctggacgc catgctcgcc gaggccggcc 6300 
agtcgctgtc cgccgtcgtc gagctggagg tgcccgacga ggagctcgtg gagcgcctgc 6360 
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tcaagcgtgc cgagatcgag ggccgcgcgg 
tggacctgta ccaccgcgag accgagtccg 
tcgcccgcgt ggacggcacc ggccagatcg 
tgtactccgt gcgctccgcc acgggctccc 
gaccccgtga tcggccgccg ctcgctcgag 
cagcgcgcgg gggtggtcct gtccgaggca 
ttcaccaccg cggagctgga cgccgtgttc 
tccaacttcc tgggctacta cgacttcccg 
gtcgtgcacg gcatcccc 



acgacaccca ggaggtcatc gagcaccgcc 6420 
tcatccagga gtacgtggag cgcggcatcg 64 80 
acgacgtcac cgagcgcctg ctgcaggccg 654 0 
tgcccgtgat ccagccgggc gcggagtcct 6600 
ctcaagaccg ccccccagct gctggccatg 6660 
ctggacgccg cgctggccgg cgcgccgggc 6720 
gcggtggtgc tggccgaacg cggtgcgacc 6780 
gcctcgatct gcacctcggt caacgaggag 6840 

6858 



<210> 42 

<211> 578 

<212> mA 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (5) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (23) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 



<222> (48) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (211) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (292) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (308) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (350) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (384) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (477) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (507) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (529) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (549) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (551) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (558) 

<223> May be any nucleic acid 
<400> 42 

ttctngtcta tggcagagat ggncaggttg ncgttgagca ggtactgncc atcagccgtc 60 
ttcagcgcca ggtagttccc atcgttctgc acacccgggt ggctccgctg cttcacgtca 120 
atattagtgg caccagctgg gatggtgaca atgtcattgt agccataatt ggtgggggtg 180 
agggacccgg agaccttcct gcaggagttg nctttgcccc cacacacccc gcatttgtcc 240 
agcttccgag gcgagtccac cacatggtca cagccggcct tgacacactg gncacggaca 300 
cagatggnca gtgtttctgg cccacacagg gtgccatcaa tcaccttggn ctcgaacact 360 
ttggaactcg ctcctccccc gggntcggga ggaacaactt gcaggggtcc cgggggggac 420 
aacccagcat tcttggggga cccactgcag gaggattccc cgtccatgtc aagtgtnatt 480 
ggtgggcatt attcttctca caattgntgc tccctgaagg ttttcccgnc aaggggggat 540 
tcccccccng ntggaatnat tggtacttgg gtctccga 578 

<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (128) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (146) 

<223> May be any nucleic acid 
<400> 43 

catttaagtt tgctagtcct ttgcaaacag actgacgctg agtgtcctgt ctgagtcaat 60 
aagtgcactt ttacctttta acctatgccc tctacttgaa cccgagcaag gtccagtcca 120 
ctggacangt tgatgatagg gtctgncgcc ccataccctc tcctcttccc ccttaggaat 180 
ttgtgcagta ctggaggggt tgcggcaatg ggaggcctgg gtgggccgtg ctgccttgat 24 0 
atggccaagg gacccagtca ccacagtgga gacccttgtc tgcacctcag taccgcatgt 300 
ccagg 305 



<210> 44 
<211> 333 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (82) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (275) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
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<222> (299) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (313) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (324) 

<223> May be any nucleic acid 



<400> 44 

ggcacaggtg actttagcat gcagagcagc aaagagagag caaccaccaa catcatccag 60 
ccgctgctcc acgcacagtg gntgctgggg gactggtctg agtgctctag cactgcgggg 120 
ccggctggca gaggcgaact gtagagtgca gggacccctc cggtgcaggc ctctgccacc 180 
tgcaacaagg ctctggaaac ccgaggatgc caagccctgg cagaaccagc tgtgccccct 240 
gtgatttcag ggggncaggg gccattttgt gctcngggac atgcggtaat ggaggttgnc 300 
agacaaggtc ttncattgtg gtgnatgggt tec 333 



<210> 45 
<211> 102 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 

<400> 45 
<220> 

<221> UNSURE 
<222> (64) 

<223> May be any nucleic acid 
<220> 
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<221> UNSURE 
<222> (69) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (72) 

<223> May be any nucleic acid 



gcagcagcag cgcagcgcag agagagcagc agcagcagca gcagcagcag cagagcagat 60 
cntnctggna nnaaaaaatc gcggcagcag ctgctctagc ag 102 



<210> 46 
<211> 123 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (51) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (52) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (57) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (123) 

<223> May be any nucleic acid 
<400> 46 

caggcaagnc ggcacgtagg agcagcagca 

agggggngcccgggacccaa ggcgcccgaa 
cgn 



gcagcagcag cagcagtaac nnagtcnacg 60 
cagagaggcg gagcacaatc cactggtcgg 120 

123 



<210> 47 
<211> 109 
<212> DNA 
<213> Unknovm 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (95) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
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<222> (102) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (106) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (107) 

<223> May be any nucleic acid 
<400> 47 

ggcacgcagg agcagcagca gcagcagcag 
agagagcagc agagcagagc agagcanagt 

<210> 48 

<211> 293 

<212> DNA 

<213> Homo sapiens 



cagcagcagc agagagagag cagcagagag 60 
agagnagagc anagcnnac iq 



<220> 

<221> UNSURE 
<222> (86) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (166) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (185) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (209) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (214) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (219) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (234) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (290) 

<223> May be any nucleic acid 



<400> 48 

ggcacgaggg ggaaactgct ccgcgcgcgc cggggaggag gaaccgcccg gtcctttagg 60 
gtccgggccc ggccgggcat ggattnaatg cctgagcccg ggtcccgctg tcttctgctt 120 
cttcccttgc tgctgctgct gctgctgctg ctgccggccc cggagntggg cccgagccag 180 
gccgnagctg aggagaacga cttgggttng cctncccana aaatgggaag gganttgggg 24 0 
ttaatcgaag tcattgggac cattttaaaa ggggcttcct ggattatagn ctt 293 



<210> 49 

<211> 506 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (283) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (356) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (362) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (364) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (368) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (429) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (454) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (461) 

<223> May be any nucleic acid 
<400> 49 

aattcggcac gagcacccgg ccactgcagt 
gcagcagcag cagcagcagc agcagcaaca 



cttctgccct gctggacagc agcagcagca 60 
gtaacagcag cagttcgtcc ggacccaacc 120 
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cttctacctc ctttgagccG atcaaggcag accccacagg tgttttggaa ctccccaaag 180 
agctgtcaga aatctttgat cccacacgag agtgcatgag ctcggagctg ctggaggagt 240 
tgatgtcctc agaagtgttt gcccctctgc tttcgtcttt ctncaccccc gggagaccac 300 
gattatatct acaacctgga cgagagtgaa ggtgtttgtg anctcttttg atgtgnctgt 360 
tntnaacntt tgactgacag ggacatgcct tttttggttg ggacccagat tttttgactt 420 
g9"gg9"tttnc ttgggacttt tcaaccgacc ctanagagtt nagagcaaan aggttggttt 4 80 
ttcggcttcc ttaacgaaag ttttgg 506 



<210> 50 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<22G> 

<221> UNSURE 
<222> {137} 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (221) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (259) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (327) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (385) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (389) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (416) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 



<400> 50 

tttaagcacc aaaacttgtg 
tgcatgctct tgtctccctt 
ggggtggtgg ttatgancag 
ttgctctgtg ctgaatggct 
tgttgtggtg ggctgctgnt 
gaactcactt cctcttcctc 
ggggcccaga tggagccttg 



ttttaatgat gttggatgga 
aatggagaga gtgtgacact 
cagtctgtca cagctcagcg 
cagtggccct acaaagcgga 
gctgctgctg ctgctgctgc 
ctgctgncac ctgtcttttg 
ctccntgant tatgataggc 



aatctttcct aaatgtgtca 60 
gcttagcact tggatggctt 120 
aggtgaagcc tgtgggcgtt 180 
ntcagctctt ggtggctttc 240 
tgctgccctt gcctctaaaa 300 
gcttgtggga ttggagtcat 360 
ccctcggtct cttttntnc 419 



<210> 51 
<211> 495 
<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> UNSURE 



<222> (177) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (322) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (368) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (371) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (375) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (380) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (386) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (396) 



<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (404) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (423) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (426) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (436) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (456) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (460) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (467) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (468) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 
<400> 51 

aattcggcac gagcaaagtt ctgcgctcca ttgtgggcat caaacgacac gtcaaagccc 60 
tccatctggg ggacacagtg gactctgatc agttcaagcg ggaggaggat ttctactaca 120 
cagaggtgca gctgaaggag gaatctgctg ctgctgctgc tgctgctgcc gcagacnccc 180 
agtccctggg actcccacct ccgagccagc tcccaccccc agcatgactg gcctgcctct 24 0 
gtctgctctt ccaccacctc ttgcacaaag cccagtcctc cggcccagaa catcctgggc 300 
ccggagttcc ttccttgcct tnaggggntt ttcagcaagt tnagttcctt gggtcctttt 360 
tgggaaantt naggnagttn aaggantacc aggttnttgc catnctttcc agatccaagt 420 
ttnacnaaaa attttnaaca gtntaaattg ggtttnttgn ccctttnngg nggntgtttt 480 
ttttttcggg tccgg 4 95 



<210> 52 
<211> 81 
<212> DNA 
<213> Unknown 

<220> 

<22 3> Description of Unknown Organism: Unknown 



<220> 

<221> UNSURE 



<222> (65) 

<223> May be any nucleic acid 
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<220> 

<221> QNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (75) 

<223> May be any nucleic acid 
<400> 52 

ggcacgcagg agcagagcag cagcagcaga 
gagananata natanatata t 



gagagcagca gcagcagcag cagcagcaga 60 

81 



<210> 53 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (11) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (62) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (81) 

<223> May be any nucleic acid 



-248- 



<220> 

<221> UNSURE 
<222> (256) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (289) 

<223> May be any nucleic acid 



<400> 53 

aggcacttga nttgaaaatg gaaaacccta 
tnatgcagcc ccagggtttt nttaatgctc 
taagtcatca cttccgacaa cagagggtgg 
aacagcagca gcagcagcag cagcagcagc 
agcagcaaac ccaggncttc agcccacctc 
ggctt 



ctgctggtgg tgctgcggtg atgaggccta 60 
aaatggtcgc ccaacgcagc agagagctgc 12 0 
ctataatgat gcagcagcag cagcagcagc 180 
aacagcaaca gcaacagcaa cagcagcaac 240 
ctaatgtgac tgcttcccnc agcatggatg 300 

305 



<210> 54 
<211> 307 
<212> DNA 

<213> Hepatitis C virus 



<220> 

<221> UNSURE 
<222> (212) 

<223> May be any nucleic acid 
<400> 54 

tggggtgtga agctccggtg ctggtgcggc gggggactgc ggggccagcc tcagtttaaa 60 
ccccctcagc agtctttctg tcgttgccct ccacactgcg agactctgga gggcgatctg 120 
gaggtctgga agataaccga ttcctgggag atttgggggt agtctccaat ctgtccctgg 180 
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ctcatcttgt gacccgaagc cggcggcctt 
aaaatacctt caaacggtag cagcagcagc 
cagcagc 



gncaggagta ttctagaatg agtgcacata 240 
agcagcagca gcagcaagca gcagcagcag 300 

307 



<210> 55 
<211> 88 
<212> DNA 
<213> Unknown 

<220> 

<221> UNSURE 
<222> (6) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (7) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (83) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 



<220> 

<223> Description of Unknown Organism: Unknown 
<400> 55 

ggacanngac tactctctct ctctctctct ctctctctgc tgctgctgct gtgctgctgc 60 
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tgctgctgct gctgccgntg tgngcana 



88 



<210> 56 
<211> 346 
<212> DNA 
<213> Unknown 



<220> 

<221> UNSURE 
<222> (278) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (288) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (299) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (313) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 



<220> 

<223> Description of Unknown Organism: Unknown 
<400> 56 

ggcacagccc aactggtgat gctgctgctg ctgctgctgc tgccgccgcc gcctctattg 60 
ctgatactct agtggggctg gaagggtggt tcctattcgc accatcgcca accagagaca 120 
gagggaaaaa aaaaaccggc agccactgct gaatgttggg ttcggaggct gcatccgact 180 
cggtcacaag gaaaatggat tcagtttgca tctctccctc ctttaaacag cttctccggg 240 
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tctcagcatg ggcttccagg gcagcgattg aggagacntt accaaggngc accacacant 300 
agatgctgag acntcgtgac tccaggataa gaaacattaa cngggg 34 6 



<210> 57 
<211> 496 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (11) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (195) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (197) 

<223> May be any nucleic acid 
<22a> 

<221> UNSURE 
<222> (286) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (291) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (293) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (315) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (329) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (344) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (346) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (352) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (354) 

<223> May be any nucleic acid 
<220> 

<2 21> UNSURE 
<222> (358) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 



<222> (366) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> C399) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (406) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (410) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (420) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (435) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (453) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (454) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (459) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (473) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (481) 

<223> May be any nucleic acid 
<400> 57 

gaattcggca naggtgcaca gatgtggtgg atggggaggg ccgcacggga cagaagttct 60 
ccctgtgtat tctgacgnct gagaaaggag catttcatcc gggcggagac caaggagatc 120 
gtcaatgggt ggctggagat gctcatggtc tatccccgga ccaacaagca gaatcagaag 180 
aagaaacgga aagtngnagc cccccacacc acaggagcct gggactgcca agttgggctg 240 
ttaccagcag cagcagcagc agcagcagca gcagcagcat ccccantgct ntnggaaagt 300 
tcccaccacc aagtnccaca atttgggnna aaaccaaggt tgtngnagac gngntttngg 360 
gatttnggca ttgtgggttg cttgcatgga aggacattng gttgtnggtn ccttggangn 420 
tacaattacc atttncggtt gtnaaggtta aanntccgnc attcagaagg ntnnaaggtg 480 
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ntttgaagtc catttg 

<210> 58 
<211> 268 
<212> DNA 

<213> Drosophila sp. 
<220> 

<221> UNSURE 
<222> (16) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (51) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (60) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (202) 

<223> May be any nucleic acid 
<400> 58 

aacacttatc cttganagct ctgtttggga 
tggagcctcc gcagactctc caccagcagc 
ttccaattag gcagggggtt gtacgctccc 
cccccattga gaagcagctc tntccagcca 
tccacccatt gtcagagctg cctgaaaa 



agcaggacaa agctacatgt naggaaactn 60 
agcagcagca gcagcagcag caagagaagc 120 
tgtcctatga ggaacccaga agacactcac 180 
ttcagaaact catggtcagg agcgcagacc 240 

268 



<210> 59 
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<211> 471 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (249) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (386) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (449) 

<223> May be any nucleic acid 
<400> 59 

tcgacccacg cgtccgctga ggaacagacg ttccctggcg gccctggcgc cttcaaaccc 60 
agacatgctg ctgctgctgc tgctgctgcc cctgctctgg gggacaaagg ggatggaggg 120 
agacagacaa tatggggatg gttacttgct gcaagtgcag gagctggtga cggtgcagga 180 
gggcctgtgt gtccatgtgc cctgctcctt ctcctacccc caggatggct ggactgactc 240 
tgacccagnt catggctact ggttccgggc aggagacaga ccataccaag acgctccagt 300 
ggccacaaac aacccagaca gagaagtgca ggcagagacc cagggccgat tccaactcct 360 
tggggacatt tggagcaacg actgcnccct gagcatcaga gacgccagga agagggataa 420 
ggggtcatat ttctttcggc tagagagang aagcatgaaa tggagttaca a 471 

<210> 60 
<211> 379 
<212> DNA 
<213> Unknown 
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<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (2) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (14) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (315) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (332) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (349) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (357) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
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<222> (374) 

<223> May be any nucleic acid 



<400> 60 

anttcggcan aggnaaggga gagggtgacc ngcatcccaa ctagatttca gtggagtgaa 60 
gttcaggagg catggagctg acaaccatga ggcctcggca gccaccgcca ccaccgccgc 120 
cgccaccacc gtagncagca gcagcagcag cagcagcagc aagagttaac tctgacttag 180 
ggaatagaga cagccagaga gaaatgtgat caatgaagga gacatctgga gtgtgcgtgc 240 
ttcttcagag gggacgggtg atgggcagat ttggaaaaag caccgcagat tgggaacctt 300 
atcttttctt tttcntaaaa ttgttgttat gnaaatttgg gtttttccng taacttntta 360 
aaaacttaaa agtnggttt 37 9 



<210> 61 
<211> 255 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (121) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (183) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (254) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 
<400> 61 

aattccgaca atggaaagca ctcttagcct tgcagtggtc tacattttta aggaaccaat 60 
atttcagcat tctttattac ccggcacgct gtgtcctttg tcagagttca agtttatggt 120 
nactgccagg gtcagacagt ccatttgctg ctgctgctgc tgctgctgct ttctcgaact 180 
ggnatggcat tagggaagct gctgtctgag tgttagggaa tgtcttggct aagtaaagcc 24 0 
aatgttcttt cctnn 255 

<210> 62 
<211> 5289 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 62 

cgagctctcc cagccgcagc ctccgaatcc acggcctcca ccccgcgcct ctccagcgct 60 

ctatcccgtc gctgcgccct tgtcgccggc cccggccgct gcatccgcgt ccgcacaggc 120 

tccttgctgg gcacaaatag ctccaccatg gggctggcct ggggactcgg tgtcctgctc 180 

ctgttgcatg cctgcggctc caaccgcatt ccagagtctg ggggagacaa cagtgrgttt 24 0 

gacatctttg aactcaccgg agctgcccgc aagcggtctg ggcgccgact ggtgaagggc 300 

cctgaccctt ctagcccagc tttccgcatc gaggatgcca acctgatccc ccctgtgcct 360 

gacaagaagt tccaagacct agtggatgct gtgcgggcgg agaaaggttc cctcctcctg 420 

gcctccctga ggcaaatgaa gaagacccgg ggtaccctgc tggctgrgga gcggaaagac 480 
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cactctggcc aggtcttcag cgtgatctcc aatggcaagg cgggcaccct ggaccrgagc 54 0 
ctgaccgtgc aggggaagca gcatgtggtg tcggtggaag aagcactcct ggcgactggc 600 
cagtggaaga gcatcaccct gtttgtgcag gaggacaggg cccagctgta catcgactgt 660 
gagaagatgg agaatgcgga gctggatgtc cccatccaga gcatcttcac cagggacctg 720 
gccagcatcg ccaggctccg cattgccaaa ggaggtgtca acgacaattt ccagggggtg 780 
ctgcagaatg taaggtttgt ctttggaacc acaccagaag acatcctcag gaacaaaggc 840 
tgctccagct ctaccagtgt ctttgtcacc cttgacaaca acgtggtgaa tgggtccagc 900 
cctgccatcc gcaccgacta cattggccac aagacaaagg acctgcaagc catctgtggc 960 
atctcatgtg acgagctgtc cagcatggtc ctggagctca ggggtctacg caccatcgtg 1020 
accacgctgc aggacagtat ccgcaaagtg accgaagaga acaaagagct ggccaacgag 1080 
ctgaggaggc ccccactctg ctaccacaac ggagtgcagt acaggactgg cgacgagtgg 1140 
acggtggaca gctgcactga gtgtcgctgc cagaactcag ttaccatctg caaaaaagtg 1200 
tcctgtccca tcatgccctg ctccaatgcc acagttccgg atggagaatg ctgcccacgg 1260 
tgctggccca gcgactctgc agacgatggc tggtccccgt ggtctgagtg gacctcttgc 1320 
tctgtgacct gtggcaatgg aatccagcag cgtggccgct cctgcgacag cctcaacaac 1380 
agatgcgagg gctcctctgt gcagacgcgg acctgccaca tccaggagtg tgacaagaga 14 40 
tttaaacagg atggcggctg gagccactgg tccccatggt catcttgctc cgtaacatgt 1500 
ggagacggtg tgatcacaag gatccggctc tgcaactccc ccagccccca gatgaatggg 15 60 
aagccatgtg agggcaaagc ccgggagacc aaagcctgcc agaaagactc ctgccccatc 1620 
aatggaggct ggggaccttg gtcaccatgg gacatctgtt ctgtcacctg tggaggaggg 1680 



-261- 



gtacagaaac gtagccggct ctgcaacaac cccaaacccc agtttggagg caaggactgc 17 40 
gttggtgatg tgacagaaaa ccagatctgc aacaagcagg actgtcccat tgacggatgc 1800 
ctgtccaatc cctgctttgc tggtgtccag tgtaccagct accctgatgg cagctggaag 1860 
tgtggtgcct gtcccccagg ctatagtgga gatggagtcg agtgcaaaga cgttgatgag 1920 
tgcaaagaag tccctgatgc ctgcttcaac cacaatggag agcacaggtg tgagaacaca 1980 
gaccccggct acaactgcct gccctgccca ccgcgcttca ctggctcgca gccctttggc 204 0 
cggggcgtgg aacatgccac cgccaacaag caggtatgca agccccgaaa cccctgcaca 2100 
gacgggacac acgactgcaa caagaacgcc aagtgcaact acctgggcca ctacagcgac 2160 
Gccatgtacc gctgcgagtg caagcctggc tacgccggca acggcatcat ctgcggggag 2220 
gacacagacc tggacggctg gcccaatgag gacctgctgt gcgtggccaa cgcaacttac 2280 
cactgcagaa aggataattg ccccaacctt cccaactcag ggcaggaaga ctatgacaag 2340 
gatggaatcg gcgatgcctg cgatgatgac gatgacaatg ataagattcc agatgacagg 2400 
gacaactgtc cattccatta caacccagcc cagtacgact atgacagaga tgacgtggga 24 60 
gaccgctgtg acaactgccc ctacaaccac aacccagacc aggctgacac agataacaat 2520 
ggggaaggag acgcctgtgc agctgacatt gatggggaca gtatccucaa tgaacgggac 2580 
aactgccagt atgtctacaa tgtggaccag aaagacactg acatggacgg ggttggtgat 2640 
cagtgtgaca actgccccct ggaacacaat ccagaccagc tcgactctga ctcggaccgc 2700 
attggagaca cctgtgacaa caatcaggat attgatgaag acggccacca gaacaatctg 2760 
gacaactgtc cctacgtgcc caacgccaac caggctgacc atgacaagga tggcaaaggc 2820 
gatgcctgtg accatgatga cgacaatgat ggcattcctg atgaccggga caactgcagg 2880 
ctggtgccca atcctgacca gaaggactct gatggtgatg gtcgaggtga tgcttgcaaa 2940 
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gatgattttg accaggacaa ggtgccagac 
atcagtgaga ctgatttccg ccgattccag 
cagaatgacc ctaactgggt tgtacgccat 
tgtgaccctg gacttgctgt aggttatgac 
ttcttcatca acaccgagag ggatgacgac 
agcagccgct tctatgttgt gatgtggaag 
cccacgaggg ctcaggggta ctctggactt 
cctggcgagc acctgcggaa tgccctgtgg 
acactgtggc atgaccctcg tcacattggc 
ctgagccaca ggccaaagac aggtttcatc 
atggctgact caggacccat ctatgacaaa 
gtcttctctc aagaaatggt gttcttctcc 
tcatcaaact gttgatcaaa agactgatca 
accatgggct tagaaaaccc ccaggatcgc 
catgagtgtg gactcctaga acatgtgact 
gaccctgcat tcagcctctg actgagaaga 
ttggcttttg caaaagcaaa agcatccaca 
tgcttttgtc agagcagaat gcgactgtga 
ttttcaggca tgtgagagaa gggaggactc 
cctactccct ctggaatggg ggcgggtggg 



attgatgaca tctgtcccga aaatgttgat 3000 
atgattcctc tagatcccaa agggacatcc 3060 
cagggtaaag aactcgtcca gactgtcaac 3120 
gaatttaacg ccgtggactt cagtggcacc 3130 
tatgccggct ttgtgtttgg ctaccagtcc 3240 
caagtcactc agtcctactg ggacaccaac 3300 
tccgtgaagg ttgtaaactc caccacgggg 3360 
cacacaggaa acacctcrgg ccaggtgcgc 3420 
tggaaagatt tcactgccta cagaiiggcat 3480 
agagtggtaa tgtatgaagg gaagaaaatc 3540 
acctatgctg gtgggaggct aggcttgttc 3 600 
gacctgaaat atgaatgcag agactcctaa 3660 
taaaccaatg ctggtattgc acctiictgga 3720 
gcctcgctgc ctgcctttgc tctctgcttg 3780 
tgcctcaaga aaatgcaatt ttccaaatca 3840 
atcttccaag gagacaaaca atgactttgg 3900 
tgctttggtt ggaaggtgcc tgtcccactc 3960 
ggccagctct gagcagtgga ctccaaaatg 4020 
actagaattg acaaacaaaa ccagccctga 4080 
ggggccaaag cccaaagggg aggatgcata 4140 
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cccaagagat gattgtatga agaaaatatg gaggaactgt tacatttttg gtactaaatc 4200 
attttcaggg gattgaaaga ctattgctgg atttcatgat gctgaccggt gttagctgat 4260 
taacccacat aaataggcac ttaaatagga gcagggaagg aaggaaaaga ctggcttctg 4320 
gacttcctcc cagatttcca ccccttaaca catcacctgt agtgaccaga acagggagtc 4380 
ggagttaaac cgacacaagg cagggccagc tgctgcagct tggttctatt gaaattgtca 4440 
gttgtattcc agatgtagct tctgcagatg tagcagcaaa ataagaatac ccaccatctc 4500 
agcgagcacc aggctgtctc ccaagggacg gcagccatgc ttgtattttt atggttagaa 4560 
aggcacaaaa ttatcaacta agacattcct tctttctctt ttttrcctga acatcatgga 4 620 
gttttccagt tgtctctttt ggactgtagt ttttagtgtt ttaaacaaac actttacaat 4 680 
gtaaactatt tattttttac ttartctggg ggatctgtct gaaagactat tcatggaaca 4740 
ggaagaagcg taaggactat ccatatcatc tttgctacaa gtcattatga ctgtaagatt 4800 
gtaaatacag attatttatt aactctgttc tacctggaat ctagtttcat atggaaagtg 4 8 60 
tttgagagca ggtagttgag atcgatcagc aaatctttca caggaatggc acaaggaaac 4 920 
cagcatagca agctgctctt caccttgtgc ttagactgga tgatttggaa ttcttttttc 4 980 
cttttttttc ccaagtggaa ttacttggtt gtccatttgc aagtgttttt agtttgcaaa 5040 
gaaagccaag aggccattaa tactgtctta tcccatccct tgtgcctatt tccagggaga 5100 
tgaaaagcat cracatttat tatttttgcc tttttccaaa agaaaaaaat gacaaaggtg 5160 
aaacttgtat acaaatatta cctcatttgt tgtgtgactg agtaaagaat tttgggatca 5220 
aacagaaaga gtttaagtgt craacaaact taaagctact gtagtaccta aaaaaaaaaa 5280 
aaaaaaaaa 5289 
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<210> 63 
<211> 2053 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism .-Unknown 
<400> 63 

gaattccggc ggccgctgag agcccaccct ggcgagctct cccagccgca gcctccgaat 60 
ccacggcctc caccccgcgc ctctccagcg ctctatcccg tcgctgcgcc cttgtcgccg 120 
gccccggcgc tgcatccgcg tccgcacagg ctccttgact gggcacaaat agctccacca 180 
tggggctggc ctggggactc ggtgtcctgc tcctgttgca tgcctgcggc tccaaccgca 240 
ttccagagtc tgggggagac aacagtgtgt ttgacatctt tgaactcacc ggagctgccc 300 
gcaacggtac tgggcgccga ctggtgaagg gccctgaccc ttctagccca gctttccgca 360 
tcgaggatgc caacctgatc ccccctgtgc ctgacaagaa gttccaagac ctagtggatg 420 
ctgtgcgggc ggagaaaggt ttcctcctcc tggcctccct gaggcaaatg aagaagaccc 480 
ggggtaccct gctggctgtg gagcggaaag accactctgg ccaggtcttc agcgtgatct 540 
ccaatggcaa ggcgggcacc ctggacctga gcctgaccgt gcaggggaag cagcatgtgg 600 
tgtcggtgga agaagcactc ctggcgactg gccagtggaa gagcatcacc ctgtttgtgc 660 
aggaggacag ggcccagctg tacatcgact gtgagaagat ggagaatgcg gagctggatg 720 
tccccatcca gagcatcttc accagggacc tggccagcat cgccaggctc cgcattgcca 780 
aaggaggtgt caacgacaat ttccaggggg tcctgcagaa tgtaaggttt gtctttggaa 840 
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtctttgtca 900 
CGcttgacaa caacgtggtg aatgggtcca gccctgccat ccgcaccgac tacattggcc 960 
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acaagacaaa ggacctgcaa gccatctgtg gcatctcatg tgacgagctg tccagcatgg 1020 
tcctggagct caggggtcta cgcaccatcg tgaccacgct gcaggacagt atccgcaaag 1080 
tgaccgaaga gaacaaagag ctggccaacg agctgaggag gcccccactc tgctaccaca 114 0 
acggagtgca gtacaggact ggcgacgagt ggacggtgga cagctgcact gagtgtcgct 1200 
gccagaactc agttaccatc tgcaaaaaag tgtcctgtcc catcatgccc tgctccaatg 1260 
ccacagttcc ggatggagaa tgctgcccac ggtgctggcc cagcgactct gcagacgacg 1320 
gctggtcccc gtggtctgag tggacctctt gctctgtgac ctgtggcaat ggaatccagc 1380 
agctggccgc tcctgcgaca gcctcaacaa cagatgcgag ggctcctctg tgcagacgcg 1440 
gacctgccac atccaggagt gtgacaagag atttaaacag gatggcggct ggagccactg 1500 
gtccccatgg tcatcttgct ccgtaacatg tggagacggt gtgatcacaa ggatccggct 1560 
ctgcaactcc cccagccccc agatgaatgg gaagccatgt gagggcaaag cccgggagac 1620 
caaagcctgc cagaaagact cctgccccat caatggaggc tggggacctt ggtcaccatg 1680 
ggacatctgt tctgtcacct gtggaggagg ggtacagaaa cgtagccggc tctgcaacaa 17 4 0 
ccccacaccc cagtttggag gcaaggactg cattggtgat gtgacagaaa accagatctg 1800 
caacaagcag gactgtccca ttgacggatg cctgtccaat ccctgctttg ctggtgtcca 18 60 
gtgtaccagc taccctgatg gcagctggaa gtgtggtgcc tgtcccccag gctatagtgg 1920 
agatggagtc gagtgcaaag acgttgatga gtgcaaagaa gtccctgatg cctgcttcaa 1980 
ccacaatgga gagcacaggt gtgagaacac agaccccggc tacaactgcc tgccctgccc 2040 
accgcccgga att 2053 



<210> 64 
<211> 4339 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 64 

agccactgcc tggagtcagc cagcctcatc ggacttctgc aggcaatcgc gaagctgcta 60 
tccagttctg ccacggtctc tcccggcgca ccggcagtct cagcgtcttc accggactca 120 
gcgtccttgt ccttcacttc acctttgcca cctctccggg ttactgagcc ccggtgcaca 180 
caggctccgt gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct 24 0 
gttcctgttg catatgtgtg gaagcaaccg cattccagag tctgggggag ataacggtgt 300 
gtttgacatc tttgaactca ttggaggtgc acgaaggggc cccggtcgcc gactggtgaa 360 
gggccaagat ctatccagcc ccgccttccg cattgagaat gccaacctga tccccgctgt 420 
gccggatgac aagttccaag acctactgga cgctgtgtgg gccgacaaag gcttcatctt 480 
cttggcttcc ttgaggcaga tgaagaagac ccggggcaca ctcctggctg tggaacggaa 540 
agacaacact ggccagatct tcagtgtggt ctccaacggc aaagctggca ccctggacct 600 
gagcctgagc ctgccaggga agcaacaagt ggtgtcagtg gaggaagctc tcctggccac 660 
tggccagtgg aagagcatca cgctgtttgt tcaagaggac cgggctcaac tctacataga 720 
ctgtgataag atggagagcg cggagctgga tgtacccatc cagagcatct tcaccaggga 780 
tctggccagc gttgccaggc tccgagttgc aaagggagat gtcaatgaca attttcaggg 840 
ggtgctgcag aatgtgaggt ttgtctttgg aaccacccca gaagacattc tcaggaacaa 900 
aggctgctcc agctctacca acgtccttct tacccttgac aacaacgtgg tgaacggttc 960 
cagccctgct atccgcacca actacatcgg ccacaaaaca aaggacctcc aagctatctg 1020 



tggcctctcc tgtgatgaac tatccagcat 
cgtgaccact ctgcaggaca gcatccgaaa 
tgagctgaag cggcctcccc tctgctttca 
gtggactgta gacagttgca cagagtgtca 
ggtgtcctgt cccatcatgc cctgctccaa 
acggtgctgg cccagcgact ctgctgacga 
ctgctctgcc acatgtggca atggaattca 
caacagatgc gagggctctt cggtacagac 
aagatttaaa caggatggtg gctggagtca 
ctgtggtgac ggtgtgatca caaggatccg 
cgggaagccc tgtgaaggtg aagcccggga 
aattaatgga ggctggggtc cctggtcacc 
aggagtgcag agacgcagcc gactctgtaa 
ctgtgttggc gatgtgacag aaaatcaagt 
atgcctgtcc aatccctgct ttgctggtgc 
gaaatgtggt gcgtgtcctc ctggctacag 
tgagtgcaaa gaagtgcctg atgcttgctt 
cacagatcct ggctacaact gcctgccctg 
cggccgaggt gtcgaacatg ccatggccaa 
cacggacggg acgcatgact gcaacaagaa 
cgaccccatg taccgctgtg agtgcaagcc 
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ggtcctggaa ctgaagggcc tgcgcaccat 1080 
agtgacggaa gagaacagag agctggtcag 1140 
caatggagtc cagtacaaga acaacgagga 1200 
ctgccagaac tcggttacca tctgcaaaaa 12 60 
cgccacagtt cctgatggtg aatgctgccc 1320 
tggctggtct ccctggtctg agtggacctc 1380 
gcaacgtggt cgttcctgtg acagcctcaa 1440 
gaggacctgc cacattcagg agtgtgacaa 1500 
ctggtctcca tggtcgtcct gttctgtgac 1560 
tctctgcaac tcccccagcc cccagatgaa 1620 
gaccaaagcc tgcaagaaag acgcctgccc 1680 
atgggacatc tgctctgtca cctgtggagg 1740 
caaccccaca ccccagtttg gaggcaaaga 1800 
ttgcaacaag caggactgcc caattgatgg 18 60 
caagtgtact agctaccctg atggtagctg 1920 
tggaaatggc atccagtgca aagacgtcga 1980 
caatcacaac ggagaacatc ggtgcaagaa 2040 
cccaccacga ttcactggct cacagccctt 2100 
caaacaggtg tgcaaaccgc gaaacccctg 2160 
cgctaagtgc aactacctgg gtcactacag 2220 
cggctatgca ggcaatggca tcatctgcgg 2280 
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agaggacaca gacctggacg gctggcctaa 
ctaccactgc aaaaaggaca actgccccaa 
caaggacggg attggcgatg cctgcgatga 
cagggacaac tgtccattcc attacaaccc 
gggagaccgc tgtgacaact gcccctacaa 
aaacggggag ggcgatgcct gtgctgtgga 
agacaactgc cagtacgttt acaacgtgga 
agatcagtgt gacaactgcc ccctggaaca 
cctcataggg gacacttgtg acaacaatca 
cctggacaac tgtccctatg tgcctaacgc 
aggagatgcc tgtgaccatg acgatgacaa 
caggctggtg cccaatcctg accagaagga 
caaagacgac tttgaccatg acaatgtgcc 
tgacatcagt gaaaccgatt tccgacgatt 
ctcccaaaat gaccctaact gggttgtccg 
aaactgtgac cctggacttg ctgtaggtta 
taccttcttc atcaacaccg agagagatga 
gtccagcagc cgcttctacg ttgtgatgtg 
caaccccaca agggctcagg gatactcagg 
cggccctggc gagcacctgc ggaatgcact 



tgaaaacctg gtgtgtgtgg ccaacgcaac 2340 
ccttcccaac tcggggcagg aagactatga 2400 
tgacgatgac aacgacaaga tccctgatga 2460 
agcccagtat gactatgaca gagatgatgt 2520 
ccacaaccct gaccaagcag acacagacaa 2580 
catcgatgga gatggaatcc tcaatgaacg 2640 
ccagagggac acggacatgg atggggttgg 2700 
caatccagac cagctggact ctgactcaga 2760 
ggacatcgat gaggatggcc atcagaacaa 2820 
caaccaggcc gaccatgata aagatggcaa 2880 
tgacggcatc cctgatgaca gagacaactg 2940 
ctctgatggt gatggccgag gtgacgcctg 3000 
agatattgat gacatctgtc ctgagaattt 3060 
ccagatgatt cctctagatc ccaaaggaac 3120 
ccatcagggc aaagaactcg tccagactgt 3180 
tgatgagttt aatgctgtgg acttcagcgg 3240 
tgactacgct ggcttggtat tcggctacca 3300 
gaaacaagtc acccagtcct actgggacac 3360 
cctgtctgta aaggttgtga actccaccac 3420 
gtggcacaca ggaaacaccc ctggccaggt 3480 
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gcgcaccctg tggcatgacc ctcgccacat cggctggaaa gatttcactg cgtacagatg 3540 
gcgtctcagc cacaggccaa agaccggtta tatcagagtg gtgatgtatg aaggaaagaa 3600 
aatcatggct gactcgggac ccatctatga caaaacctac gccggcggta gactaggcct 3660 
gttcgtcttc tctcaggaaa tggtgttctt ctcagacatg aaatacgagt gtcgagattc 3720 
ctaatcatca gctgccaatc ataaccagcg ctggcaatgc accttctaaa aacaagggct 3780 
agagaaaccc cccacccctg ccgggatcgc ctttcctcgc cttccttgcc tctcttcttg 3840 
catagtgtgg acttgtaaag cctgagacct gcctcaagaa aatgcagttt tcgaacccag 3900 
agtcagcact cggcctttaa cgaatgagaa tgcatcttcc aagaccatga agagttcctt 3960 
gggtttgctt ttgggaaagc caaagcgcct atttacttcc cactaggaag gtgcccgctc 4020 
cactctgcct tactcacaga gccagaactt cttcgaggcc acctctgagc agcacacaca 4080 
gaagcatttt caggcatgtc aaagaaagga aaaatgactc actagaactc accgccaaac 4140 
aacctctgac ataggtcctg agatgtgggg aggcaggagc caaagctcta gggagggcat 4200 
gtacccaaga gatgactgta tgaagaaaat gtggaggagc tgttcggtac taaatcattt 4260 
tcaggggaca gacagacttg ctgcatttcc gcatgctgct ggtgagagct gattgaccca 4320 
atcttccaca caggcactt 4339 



<210> 65 
<211> 186 
<212> mh 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 65 

gcacagttaa tggaggctgg ggtccctggt caccatggga catctgctct gtcacctgtg 60 
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gaggaggagt gcagagacgc agccgactct gtaacaaccc cacaccccag tttggaggca 120 
aagactgtgt tggcgatgtg acagaaaatc aagtttgcaa caagcaggac tgcccaattg 180 
gtaagc 186 

<210> 66 
<211> 5774 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 66 

gtcactttgg ttgatagcag ccgctctggt agaggttagg acttcagctg atggacaagc 60 
tggtaatgaa gaaatggtgc aaatagattt accaataaag agatatagag agtatgagct 120 
ggtgactcca gtcagcacaa atctagaagg acgctatctc tcccatactc tttcrgcgag 180 
tcacaaaaag aggtcagcga gggacgtgtc ttccaaccct gagcagttgt tctttaacat 240 
cacggcattt ggaaaagatt ttcatctgcg actaaagccc aacacrcaac tagtagctcc 300 
tggggctgtt gtggagtggc atgagacatc tctggtgcct gggaatataa ccgatcccat 360 
taacaaccat caaccaggaa gtgctacgta tagaatccgg aaaacagagc ctttgcagac 420 
taactgtgct tatgttggtg acatcgtgga cattccagga acctctgttg ccatcagcaa 480 
ctgtgatggt ctggctggaa tgataaaaag tgataatgaa gagtatttca ttgaaccctt 540 
ggaaagaggt aaacagatgg aggaagaaaa aggaaggatt catgttgtct acaagagatc 600 
agctgtagaa caggctccca tagacatgtc caaagacttc cactacagag agtcggacct 660 
ggaaggcctt gatgatctag gtactgttta tggcaacatc caccagcagc tgaatgaaac 720 
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aatgagacgc cgcagacacg cgggagaaaa cgattacaat atcgaggtac tgcrgggagt 780 
ggatgactct gtggtccgtt tccatggcaa agagcacgtc caaaactacc tcctgaccct 840 
aatgaacatt gtgaatgaaa tttaccatga tgagtccctc ggagtgcata taaatgtggt 900 
cctggtgcgc atgataatgc tgggatatgc aaagtccatc agcctcatag aaaggggaaa 960 
cccatccaga agcttggaga atgtgtgtcg ctgggcgtcc caacagcaaa gatctgatct 1020 
caaccactct gaacaccatg accatgcaat ttttttaacc aggcaagact ttggacctgc 1080 
tggaatgcaa ggatatgctc cagtcaccgg catgtgtcat ccagtgagaa gttgtaccct 1140 
gaatcatgag gatggttttt catctgcttt tgtagtagcc catgaaacgg gccatgngtt 1200 
gggaatggag catgatggac aaggcaacag gtgtggtgat gagactgcta tgggaagtgt 1260 
catggctccc ttggtacaag cagcattcca tcgttaccac tggtcccgat gcagtggtca 1320 
agaactgaaa agatatatcc attcctatga ctgtctcctt gatgaccctt ttgatcatga 1380 
ttggcctaaa ctcccagaac ttcctggaat caattattct atggatgagc aatgtcgttt 1440 
tgattttggt gttggctata aaatgtgcac cgcgttccga acctttgacc catgtaaaca 1500 
gctgtggtgt agccatcctg ataatcccta cttttgtaag actaaaaagg gacctccact 1560 
tgatgggact gaatgtgctg ctggaaaatg gtgctataag ggtcattgca tgtggaagaa 1620 
tgctaatcag caaaaacaag atggcaattg ggggtcatgg actaaatttg gctcctgttc 1680 
tcggacatgt ggaactggtg ttcgtttcag aacacgccag tgcaataatc ccatgcccat 1740 
caatggtggt caggattgtc ctggtgttaa ttttgagtac cagctttgta acacagaaga 1800 
atgccaaaaa cactttgagg acttcagagc acagcagtgt cagcagcgaa actcccactt 1860 
tgaataccag aataccaaac accactggtt gccatatgaa catcctgacc ccaagaaaag 1920 
atgccacctt tactgtcagt ccaaggagac tggagatgtt gcttacatga aacaactggt 1980 
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gcatgatgga acgcactgtt cttacaaaga tccatatagc atatgtgtgc gaggagagtg 2040 
tgtgaaagtg ggctgtgata aagaaattgg ttctaataag gttgaggata agtgtggtgt 2100 
ctgtggagga gataattccc actgccgaac cgtgaagggg acatttacca gaacucccag 2160 
gaagcttggg taccttaaga tgtttgatat accccctggg gctagacatg tgttaatcca 2220 
agaagacgag gcttctcctc atattcttgc tattaagaac caggctacag gccattatat 2280 
tttaaatggc aaaggggagg aagccaagtc gcggaccttc atagatcttg gtgtggagtg 2340 
ggattataac attgaagatg acattgaaag tcttcacacc gatggacctt tacatgatcc 2400 
tgttattgtt ttgattatac ctcaagaaaa tgatacccgc tctagcctga catataagta 24 60 
catcatccat gaagactctg tacctacaat caacagcaac aatgtcatcc aggaagaatt 2520 
agatactttt gagtgggctt tgaagagctg gtctcaggtt tccaaaccct gtggtggagg 2580 
tttccagtac actaaatatg gatgccgtag gaaaagtgat aataaaatgg tccatcgcag 2640 
cttctgtgag gccaacaaaa agccgaaacc tattagacga atgtgcaata ttcaagagtg 2700 
tacacatcca ctctgggtag cagaagaatg ggaacactgc accaaaacct gtggaagttc 2760 
tggctatcag cttcgcactg tacgctgcct tcagccactc cttgatggca ccaaccgctc 2820 
tgtgcacagc aaatactgca tgggtgaccg tcccgagagc cgccggccct gtaacagagt 2880 
gccctgccct gcacagtgga aaacaggacc ctggagtgag tgttcagtga cctgcggtga 294 0 
aggaacggag gtgaggcagg tcctctgcag ggctggggac cactgtgatg gtgaaaagcc 3000 
tgagtcggtc agagcctgtc aactgcctcc ttgtaatgat gaaccatgtt tgggagacaa 3060 
gtccatattc tgtcaaatgg aagtgttggc acgatactgc tccataccag gttataacaa 3120 
gttatgttgt gagtcctgca gcaagcgcag tagcaccctg ccaccaccat accttctaga 3180 
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agctgctgaa actcatgatg atgtcatctc taaccctagt gacctcccta gatctctagt 3240 
gatgcctaca tctttggttc cttatcattc agagacccct gcaaagaaga tgtctttgag 3300 
tagcatctct tcagtgggag gtccaaatgc atatgctgct ttcaggccaa acagtaascc 3360 
tgatggtgct aatttacgcc agaggagtgc tcagcaagca ggaagtaaga ctgtgagact 3420 
ggtcaccgta ccatcctccc cacccaccaa gagggtccac ctcagttcag cttcacaaat 3480 
ggctgctgct tccttctttg cagccagtga ttcaataggt gcttcttctc aggcaagaac 3540 
ctcaaagaaa gatggaaaga tcattgacaa cagacgtccg acaagatcat ccaccttaga 3600 
aagatgagaa agtgaaccaa aaaggctaga aaccagagga aaacctggac aacctctctc 3660 
ttcccatggt gcatatgctt gtttaaagtg gaaatctcta tagatcgtca gctcatttta 3720 
tctgtaattg gaagaacaga aagtgctggc tcactttcta gttgctttca tcctcctttt 3780 
gttctgcatt gactcattta ccagaattca ttggaagaaa tcaccaaaga ttattacaaa 3840 
agaaaaatat gttgctaaga ttgtgttggt cgctctctga agcagaaaag ggactggaac 3900 
caattgtgca tatcagctga ctttttgttt gttttagaaa agttacagta aaaattaaaa 3960 
agagatacca atggtttaca ctttaacaag aaattttgga tatggaacaa agaattctta 4020 
gacttgtatt cctatttatc tatattagaa atattgtatg agcaaatttg cagctgttgt 4080 
gtaaatactg tatattgcaa aaatcagtat tattttaaga gatgtgttct caaatgattg 4140 
tttactatat tacatttctg gatgttctag gtgcctgtcg ttgagtattg ccttgtttga 4200 
cattctatag gttaattttc aaagcagagt attacaaaag agaagttaga attacagcta 4260 
ctgacaatat aaagggtttt gttgaatcaa caatgtgata cgtaaattat agaaaaagaa 4320 
aagaaacaca aaagctatag atatacagat atcagcttac crattgcctt ctatacttat 4380 
aatttaaagg attggtgtct tagtacactt gtggtcacag ggatcaacga atagtaaata 44 40 
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atgaactcgt gcaagacaaa actgaaaccc tctttccagg acctcagtag gcaccgttga 4500 
ggtgtccttt gtttttgtgt gtgtgtgttc ttttttaatt ttcgcattgt tgacagatac 4560 
aaacagttat actcaatgta ctgtaataat cgcaaaggaa aaagttttgg gataacttat 4 620 
ttgtatgttg gtagctgaga aaaatatcat cagtctagaa ttgatatttg agtatagtag 4680 
agctttgggg ctttgaaggc aggttcaaga aagcatatgt cgatggttga gatatttatt 4740 
ttccatatgg ttcatgttca aatgttcaca accacaatgc atctgactgc aataatgtgc 4800 
taataattta tgtcagtagt caccttgctc acagcaaagc cagaaatgct ctctccaggg 4860 
agtagatgta aagtacttgt acatagaatt cagaactgaa gatatttatt aaaagttgat 4 920 
ttttttttct tgatagtatt tttatgtact aaatatttac actaatatca attacatatt 4980 
ttggtaaact agagagacat aattagagat gcatgctttg ttctgtgcat agagaccttt 5040 
aagcaaacta ctacagccaa ctcaaaagct aaaactgaac aaatttgatg ttatgcaaac 5100 
atcttgcatt tttagtagtt gatattaagt tgatgacttg tttcccttca aggaaacatt 5160 
aaattgtatg gactcagcta gctgttcaat gaaattgtga attagaaaca tttttaaaag 5220 
tttttgaaag agataagtgc atcatgaatt acatgtacat gagaggagat agtgatatca 5280 
gcataatgat tttgaggtca gtacctgagc tgtctaaaaa tatattatac aaactaaaat 5340 
gtagatgaat taacctctca aagcacagaa tgtgcaagaa cttttgcatt ttaatcgttg 5400 
taaactaaca gcttaaacta ttgactctat acctctaaag aattgctgct actttgtgca 5460 
agaactttga aggtcaaatt aggcaaattc cagatagtaa aacaatccct aagccttaag 5520 
tctttttttt ttcctaaaaa ttcccataga ataaaattct ctctagttta cttgtgtgtg 5580 
catacatctc atccacaggg gaagataaag atggtcacac aaacagtttc cataaagatg 5640 
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tacatattca ttatacttct gacctttggg ctttcttttc tactaagcta aaaattcctt 5700 
tttatcaaag tgtacactac tgatgctgtt tgttgtactg agagcacgta ccaataaaaa 5760 
tgttaacaaa atat 5774 

<210> 67 
<211> 5535 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 67 

ggactttaga agccgttgct gccctctctg tcacctgaag cggggccctc tcccatccca 60 
cccttgcccc gcctccctgc ccccaccggg ccggccctgc ccgccgccgg accctggcat 120 
gtcaagacct ggtccgcgcc tgcctgccca gcccgcggaa ccccggcggc cccgcgagct 180 
aggatgaggg gccaggccgc cgccccgggc cccgtctgga tcctcgcccc gctgctactg 240 
ctgctgctgc tgctgggacg ccgcgcgcgg gcggccgccg gagcagacgc ggggcccggg 300 
cccgagccgt gcgccacgct ggtgcaggga aagttcttcg gctacttctc cgcggccgcc 360 
gtgttcccgg ccaacgcctc gcgctgctcc tggacgctac gcaacccgga cccgcggcgc 420 
tacactctct acatgaaggt ggccaaggcg cccgtgccct gcagcggccc cggccgcgtg 480 
cgcacctacc agttcgactc cttcctcgag tccacgcgca cctacctggg cgtggagagc 54 0 
ttcgacgagg tgctgcggct ctgcgacccc tccgcacccc tggccttcct gcaggccagc 600 
aagcagttcc tgcagatgcg gcgccagcag ccgccccagc acgacgggct ccggccccgg 660 
gccgggccgc cgggccccac cgacgacttc tccgtggagt acctggtggt ggggaaccgc 720 
aaccccagcc gtgccgcctg ccagatgctg tgccgctggc tggacgcgtg tctggccggt 780 
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agtcgcagct cgcacccctg 
gcgggcggcc ctgccgcggg 
gtggctggtg gccctgaaaa 
gccacaggcg gctggaagct 
ggcctccaga cgcggacgcg 
gagggggtgc tggaggaggg 
accagctccc ggagccagtc 
gacgagctgc agcagtttgg 
tggtccccgt ggagcgtgtg 
ttctgcgtgt cctcctccta 
tgcaacaact ctgccgtgtg 
ctctgctcca gcacctgtgg 
cagtttgggg gcaacccctg 
ctgtgccctg gccgggcagt 
tccgccagct gctcccaggg 
gggggtgcgg agtgccaggg 
ccagtggatg gcaagtggca 
gctggcagcc agcgacggga 
cagggccccG aggatgagta 
atctgtgatg aggacaactt 



cgggatcatg cagaccccct 
acccctggcc ccccgcgggg 
ctgcctcacc agcctgaccc 
gtggtccctg tggggcgaat 
cacctgcctg cccgcgccgg 
tcgccagtgc aaccgcgagg 
cctgcggtcc acagatgccc 
gttcccagcc ccccagaccg 
ctccagcacc tgcggcgagg 
cagcacgcag tgcagcggac 
cccagtgcat ggtgcctggg 
ccgtggcttt cgggatcgca 
tgagggccct gagaagcaaa 
ggatggaaac tggaatgagt 
ccgacagcag cgcacgcgtg 
ccactgggtg gagacccgag 
ggcctgggcg tcatggggca 
gcgtgtctgc tctgggccct 
ccggcagtgc ggcacccagc 
tggtgctgtg atctggaagg 



gcgcctgcct gggcggcgag 840 
atgtctgctt gagagatgcg 900 
aggaccgggg cgggcacggc 960 
gcacgcggga ctgcggggga 1020 
gcgtggaggg cggcggctgc 1080 
cctgcggccc cgctgggcgc 1140 
ggcggcgcga ggagctgggg 1200 
gtgacccagc agccgaggag 1260 
gctggcagac ccgcacgcgc 1320 
ccctgcgcga gcagcggctg 1380 
atgagtggtc gccctggagc 1440 
cgcgcacctg caggcccccc 1500 
ccaagttctg caacattgcc 1560 
ggtcgagctg gagcgcctgc 1620 
aatgcaacgg gccttcctac 1680 
actgcttcct gcagcagtgc 17 4 0 
gttgcagcgt cacgtgtggg 1800 
tcttcggggg agcagcctgc 1860 
ggtgtcccga gccccatgag 1920 
agaccccagc gggagaggtg 1980 
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gctgctgtcc ggtgtccccg caacgccaca ggactcatcc tgcgacggtg tgagctggac 2040 
gaggaaggca tcgcctactg ggagcccccc acctacatcc gctgtgtttc cattgactac 2100 
agaaacatcc agatgatgac ccgggagcac ctggccaagg ctcagcgagg gctgcctggg 2160 
gagggggtct cggaggtcat ccagacactg gtggagatct ctcaggacgg gaccagctac 2220 
agtggggacc tgctgtccac catcgatgtc ctgaggaaca tgacagagat tttccggaga 2280 
gcgtactaca gccccacccc tggggacgta cagaactttg tccagatcct tagcaacctg 2340 
ttggcagagg agaatcggga caagtgggag gaggcccagc tggcgggccc caacgccaag 2400 
gagctgttcc ggctggtgga ggactttgtg gacgtcatcg gcttccgcat. gaaggacctg 24 60 
agggatgcat accaggtgac agacaacctg gttctcagca tccataagct cccagccagc 2520 
ggagccactg acatcagctt ccccatgaag ggctggcggg ccacgggtga ctgggccaag 2580 
gtgccagagg acagggtcac tgtgtccaag agtgtcttct ccacggggct gacagaggcc 2 640 
gatgaagcat ccgtgtttgt ggtgggcacc gtgctctaca ggaacctggg cagcttcctg 2700 
gccctgcaga ggaacacgac cgtcctgaat tctaaggtga tctccgtgac tgtgaaaccc 2760 
ccgcctcgct ccctgcgcac acccttggag atcgagtttg cccacatgta taatggcacc 2820 
accaaccaga cctgtatcct gtgggatgag acggatgtac cctcctcctc cgcccccccg 2880 
cagctcgggc cctggtcgtg gcgcggctgc cgcacggtgc ccctcgacgc cctccggacg 2 940 
cgctgcctct gtgaccggct ctccaccttc gccatcttag cccagctcag cgccgacgcg 3000 
aacatggaga aggcgactct gccgtcggtg acgctcatcg tgggctgtgg cgtgtcctct 3060 
ctcaccctgc tcatgctggt catcatctac gtgtccgtgt ggaggtacat tcgctcagag 3120 
cgttctgtca tcctcatcaa cttctgcctg rccatcatct cctccaatgc cctcatcctc 3180 
atcgggcaga cccagacccg caacaaggtg atgtgcacgc tggtggccgc cttcctgcac 324 0 



-278- 



ttcttcttcc tgtcctcctt ctgctgggtg ctcaccgagg cctggcagtc ctacatggcc 3300 
gtgacgggcc acctccggaa ccgcctcatc cgcaagcgct tcctctgcct gggctggggg 3360 
ctccctgcac tggttgtggc catttctgtg ggattcacca aggccaaagg gtacagcacc 3420 
atgaactact gctggctctc cctggagggg ggactgctct atgccttcgt gggacctgcc 3480 
gctgccgttg tgctggtgaa catggtcatt gggatcctgg tgttcaacaa gctcgtgtcc 3540 
aaagacggca tcacggacaa gaagctgaag gagcgggcag gggcctccct gtggagctcc 3600 
tgcgtggtgc tgccgctgct ggcgctgacc tggatgtcgg ctgtgctcgc cgtcaccgac 3650 
cgccgctccg ccctcttcca gatcctcttc gctgtcttcg actcgctgga gggcttcgtc 3720 
atcgtcatgg tgcactgtat cctccgtaga gaggtccagg acgctgtgaa atgccgtgtg 3780 
gttgaccggc aggaggaggg caacggggac tcagggggct ccttccagaa cggccacgcc 3840 
cagctcatga ccgacttcga gaaggacgtg gatctggcct gtagatcagt gctgaacaag 3900 
gacatcgcgg cctgccgcac tgccaccatc acgggcacac tgaagcggcc gtctctgccc 3960 
gaggaggaga agctgaagct ggcccatgcc aaggggccgc ccaccaattt caacagcctg 4020 
ccggccaacg tgtccaagct gcacctgcac ggctcacccc gctatcccgg cgggcccctg 4080 
cccgacttcc ccaaccactc actgaccctc aagagggaca aggcgcccaa gtcctccttc 4140 
gtcggtgacg gggacatctt caagaagctg gactcggagc tgagccgggc ccaggagaag 4200 
gctctggaca cgagctacgt gatcctgccc acggccacgg ccacgctgcg gcccaagccc 4260 
aaggaggagc ccaagtacag catccacatt gaccagatgc cgcagacccg cctcatccac 4320 
ctcagcacgg cccccgaggc cagcctcccc gcccgcagcc cgccctcccg ccagcccccc 4380 
c^"3cggcgggc cccccgaggc accccctgcc cagcccccac cgcctccgcc cccaccgcca 4 440 
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ccacctcccc agcagcccct gcccccaccg cccaatctgg agccggcacc ccccagcctg 4500 
ggggatcccg gggagcctgc cgcccatccg ggacccagca cggggcccag caccaagaac 4560 
gagaatgtcg ccaccttgtc tgtgagctcc ctggagcggc ggaagtcgcg gtatgcagaa 4 620 
ctggactttg agaagatcat gcacacccgg aagcggcacc aagacatgtt ccaggacctg 4 680 
aaccggaagc tgcagcacgc agcggagaag gacaaggagg tgctggggcc ggacagcaag 4740 
ccggaaaagc agcagacgcc caacaagagg ccctgggaga gcctccggaa agcccacggg 4 8 00 
acgcccacgt gggtgaagaa ggagctggag ccgctgcagc cgtcgccgct ggagcttcgc 4860 
agcgtggagt gggagaggtc gggcgccacg atcccgctgg tgggccagga catcatcgac 4 920 
ctccagaccg aggtctgagc gggtgggcgg cggccacgca ctgggccacg gaggagggat 4 980 
gctgctccgc ccgctcctgc cgcagacggg cacagacacg ctcgcgggca gcgggccagg 504 0 
cccgcacccc ggcctcaggg cgctcagacg gcggccaggc acagggcccg cagtgctggg 5100 
accagagcca gatgcaggac aggaggcggc ccggccagcg ggcacagggc accagaggcc 5160 
gaaggtgcct cagactccgc cctcctcggg ccgaggccca gcgggcagat gggcggacgg 5220 
ctgtggaccg tggacaggcc cagcgcggcc agcgtcccag ggtacccgcc tgagctcccg 5280 
ctgcggagga gctgcctgct tggcccggcc ggcctggcac cgttttttaa acacccccat 534 0 
ccctcgggaa gcagccagct ccccacacct tccagggccc taggcccctc ctagacccag 5400 
gtggagggca cagccctccg accctcatgg cccccagggg caggactgag tcccctccag 54 60 
gaagaagcag gggggaatct attttttctc tccttttctt ttcttcaata aaaagaatta 5520 
aaaacccaaa aaaaa 5535 



<210> 6£ 
<211> 3^ 
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<212> DMA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 68 

cggggcaacc cgctggagtg gacgggccag gtgacggtgc gcaagaagcg caagccctac 60 
tccaagttcc agacgctcga gctcgagaag gagttcctct tcaacgcgta cgtcagcaag 120 
cagaagcgct gggagctggc gcgcaacctc aacctcaccg agcgccaggt caagatctgg 180 
ttccagaacc ggcgcatgaa gaacaagaag aacagccagc gccaggcggc cagcagcagc 240 
agcagcaaca gcagcagcag cagcagcagc aacagcagca agcggccgcc ggcggggcgt 300 
cggccgccgc caacggccac cagggccacc aagcgcacca ccacgcgccc cccaacggcg 360 
ccgtcgcagc cctcaagcac caccagtgac ccgtagcg 398 

<210> 69 
<211> 8670 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 69 

cccgggtgcg gtgtcgtgtg tggggctggg cgccatgttc ctggacatgc tgagggccaa 60 

gcgcgacacg gcgcccgacc gccgccagct ggacgaccgg atgatggggg cggacccggg 120 

ggacatagcg gccaaggtga gggcagggtt ttgcgtgcgt gcttgattgt gcgtgtgcgt 180 

gcgtgcgtgc gtgcgtgcgg tgttgcgtgt gtatttgaac tgtgttttgt gtatgtactt 240 

aggggtaaga gtgcatacac atgcatgcga ccggtggcct tacaaatcaa caacacgtac 300 
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gcctgcatgt atccaggtgg cagcgtggcg acgagcacgt ggcttcgagg gcccaggcac 360 
ggcgggcccc agcggcagcg ccgccagtgg cagcggcgcc agcggctcgg caccgcaggc 42 0 
gcgctcgccc cgacctcagc caccgcggcc gcgctcacct tcacgcgggt gaaccccggc 480 
gaggagccgc ccgtgtacgc gtgcgagcaa acaggtgcgt aagcgacgtg tgggcagcgc 54 0 
gaagaggcgt gggggcgaga gagcaaaggg actagggaaa cgcacagcca aatacggtat 600 
gcgggcaacg aggcgatggc cctggaaatc gcagggccct tttgaaatcg tgtaaggcgc 660 
aattgctggg cgactaccgt agtctactga tgcattgcac tacttgtatt actgtatcct 720 
actgcagtag tgccgttgcc agccgcgctg ctgccctttg gctcccttcc caatccaaat 780 
ggcccatgcc tcgcgcactc cgagcaccca gagcacccag aagccgttgc gtgcgctccg 84 0 
ccgccgccct ctcccccgcc ttcacttctt aattaatcgt gaatgtaatc cccccccccc 900 
ccgcttcctc aggctgggtg cacgtgtgcg cgacgcctgc acggagggtg tggtggatgc 960 
ccgcagcgaa ctgctggtgt gcccggtgag tcgacgagga ggaggtgcaa gggggatacc 1020 
agcgcgtgtt tctcagggcc tgtgtgggac accgaaacgt ggtaaaagag acccgcccgc 1080 
gaactgtgta tgtggagtag cgtggcgtgt gcggccggac cgacaaggca gcttgtggac 1140 
tgccccacgt tgcagagtca gctgacaacg acacgtgcgc cttcctgtca ttgcccgtgc 1200 
gcacgcacgt cctccgcact cccaacaaat tgacagcgac acgtgcgcct tcctataagc 1260 
Gtatgcccgc acacgctccc gcgccctcag gtgtcgggcc agaccacaga ccggttggtc 1320 
cacgagtgcg aggaggatga ggcgggcggc tgcggcggcg ccggcggggc gccgcggcga 1380 
ggaggacggc ctgggactgg gcatcacagg tgggtggcag gctggcaggg actcacgcat 1440 
gggccttgta cgtgactgcg gttctgcatg gctagtggct cacgcgctgc gcacgttcac 1500 
gtacggcttg tgggcatgca gtgccttgac gtgaggctgc gctgccttgc tgctgccgcc 1560 
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ttgccccgct ccctgcacac actgcagccg 
agtgcgagaa cgcgcagcag ctcaacaggc 
ccgcaggggg agtgtgttca tattgtggtt 
gtattgcacg cgtacagcat tggtgactgg 
aggcgctgca gcgagttgtg gggcatgcgt 
gttgagttgg cgtttggagg tgtgggacga 
agaaggcgtg accgttttac gagctcgtat 
atttttgtta caagcgacga atgcgtgggg 
ttgcgcatgc acacgtgacc aaatttatgc 
ttgtgtatcg gttggcactt ataacaattg 
gatcctgaaa gccccagttc aacaagtcgg 
atcaaggggc ctctaagtgc ctcgcggcaa 
gctggactcg tgcacttgtc gacgccgtcg 
caattgtgga agccgctgac gaattgtccg 
cgtctcagcg tcatggccca ggtgcggacg 
gccgcaccga gtctgcgcgg acgtctcgct 
ggcagctgct gcgcctgctg acaacactaa 
gacggacccg ctcatggaga agttcaacga 
tgaggacatc aaggtgcggc acagggaggg 
cgcgggtttg gtggccaggg cagggaggga 



gcttcgggcg ctacttcacc gcgggctacg 1620 
tgctggggta caaggcgctg tgagagcgcg 1680 
gtttgggccg tgggcgcggg ctgcatgtgc 1740 
tcaggtgtaa gcggccggca gtgcgccgcg 1800 
catgcgcaga cggcccctgg acgacaaggc 18 60 
cgtggggttt gtgccgtcaa agcacagaac 1920 
gatgtagcat ggattgaata atgacatgtg 1980 
ttttggatgg caggggtttc agtcgcccga 2040 
tcaacgacgt gaccattgct ttatacatac 2100 
gctcgtcaaa ttgacgcgag gctgcacttc 2160 
atagccaaat ggccccgctc gcrctccagc 2220 
cccagcgcaa gtgtgctcgc gttgcggtga 2280 
gcaccgcaat cgaaagacgc gtgcgtcgag 234 0 
catgtgacat tgcaggctcg cgtccccgct 24 00 
ttgggactgc acttgcacga atgtgatggg 2460 
gacgtttcgc gttgaatgca tctcgcaata 2520 
gaagctgtgg ggcggtcgct tcacgggcaa 2580 
gtcgctgccc tttgacaagc gcctgtgggc 2640 
gggcgagtgg tggggtgggg ctggggggga 2700 
agacgtgcgg ggctaggcaa gaggctgcga 2760 
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gggcccaggg taacaccaga ccgtgccgtg tcgcgtgccc ggcttgctgc ccaccttgcc 2820 
cggccatccc caccgccctc cccaccagca atgacacgta cacattcaca cactccccca 2880 
cacccacata cccacacacc cacgcattcc ccaacagggc agccaggcgt acgccaaggc 2940 
tcttgccaag gccggcattc tggcacatga cgaggccgtg accattgtgg aggggctggc 3000 
caaggtgcgc acacccggca gcagggcggg tgggtgggtg ggtggggtgg gggggcagag 3060 
agaggcgcgg gctgagaggg ggctgagagg ggggtcagcg aggcgcaggc tcagggggag 3120 
gcgtctgagg ggggctgaga tggtggtggg ggagctgcgg gtgctggggc tgctgcggtg 3180 
gcgggcgggc gggcgggcgg gcgacgtgta cgtgagtagc cgctgaccgg gcgctgggcc 3240 
tttgcgcacg ccacagccca catgacaccg ccgcaaggcc cgccgcgccc cacccacgtt 3300 
cacacactcc ccacacccac gcgtgcgcgc gcctccttcc cctcaataca cgcgcctcct 3360 
tcccctggcc cccgcctgct ccccccatcc ggccgccccg cctgcaggtg gctgaggagt 3420 
ggaaggcggg tgcctttgtg atcaaggcgg gtgacgagga catccacacg gccaacgagc 3480 
ggcgcctcac ggagctggtg ggggcggtgg gcggcaagct gcacaccggc cgctcgcgca 3540 
acgaccaggt gagggtgggt gggtgggggt ggggtgggtg ggtgggtggg tgggtgggtg 3600 
ggtgggtggg tgggtgggtg ggtgggtggg ggtttgagat accggtacca ggccaaacta 3 660 
aaccgaaccc aagggggtgg cgtaggggcg tgggaggggg ggagtgcgga agccgggagg 3720 
caggagtaag ggcgggagga gggggccgga ggagaagcag ggacgaagtc gatgacaggc 3~fS0 
gcagtcggtg gcggcggtgg cgggtgtgcc gttgtgcagt ggctgtggag gccatgtgca 3840 
gggcggcggc ggggccgggc cgggggtggg agacttgtcc agaccccgtg gccctcttcc 3900 
agccccgtcc gccactgccg ccaccaccac cgccgccgcc gtagccacca cccctcacgt 3960 
cgaggcactt cacagatgcg aagcaaccac accgttctcc acatgaacag ctaccctccc 4020 
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aaacccaact ttcccttccc gccttaccta accatgaccc gctacccccc ccccctttat 4080 
ttcttaacta accatgaatg cccccccccg gctgtacctg gctacgactt cacttcgtaa 4140 
acttaatgtg tgtaaccccc cttacacaca cacacacacc cctccccgcc cctccaaagg 4200 
ttgccaccga ctaccggctg tggctggtgg gtcaggtgga ggtgatgcgg tccgaggrgg 4260 
gcgagctgat gcgcgtggcg gcggaccgct ccgaggcaga ggtggaggtg ctcatgccgg 4320 
gtgagggggc agggaggggg ggagggggag ggggaggtgc tcatgccggt gagggtaggg 4380 
aggggagggg cagaggaggg agggggagga gggggcggct gagtgcggga gaggcaggga 4 4 40 
tgagggcgat agaaagttgc gtattgtcgg taaactcaaa ggactagacg aagagaacaa 4500 
acctaaacaa gggagctgga gcgaggccaa atctgaacgt gacatcgccc gcctcctccc 4560 
gctgcctgct cccccacctc ctcccccatc tcgccccccc ccccacacac acacaggctt 4620 
cacgcacctt cagaatgcca tgactgtgcg ctggagccac tggctgatga gccacgccgc 4 680 
ggcctggcag cgcgacgaca tgcggctgcg ggacctgctg ccgcgggtgg ccacactgcc 4740 
gctgggctcg ggtgggtgag ggaggggagg ggaggggagg gggggagggg gagggagagg 4800 
aggggagaag ggggggggag acgaggaggg tggaagggtg ggggcggggc ggtggaggct 4 8 60 
agagggtggg gctgggtggg tggacggagt gcactggtag aggagggata gggtacattg 4 920 
agacgggagg agggatgcag gggcgaaggt ggggaggagg ggaggggagg aggcgtggag 4 980 
ctggagtggg ccgacgagtg tgcggacggg gcaggcggca acggggatta aacggcgggg 5040 
ggccggggcg tgtgcacgac aggggcttgc gcgtctgcga ttgtgggggc acacagggac 510 0 
aggagcacga cgtgggacac gcatagatac gccgcattga caacacacac acacacacac 5160 
acacacacac acacacacac acacacacaa acacaaacac acacaaacac aaacacacac 5220 
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acgccccccc ccctacacac acgccccctc cccaggcgcc ctggccggca acccctttct 5280 
ggtggaccgc cagttcatcg ccaaggagtt gggtttcggc ggcggcgtgt gccccaactc 5340 
catggacgcg gtgaggggag gaggaggggg aggagggcgg gggggggcag gaggggggag 5400 
gaggaggggg ggagggggtt aactttgaag cgtaaggaaa cagtcgggag gaggggggga 54 60 
^ggagggggc ctggaggagg gggggaggag gagggtggct ggagggggct gggggaggag 5520 
gagggggagg attgggaggg ggctggggga gggtgcccgc agctggggga ggtggggagg 5580 
gagggggttg ctgctggtgt aaagggcctg taggcactga gagcactgtg gggagccggg 5640 
gtactgcctg gggccccgcg ctgcagaggt gtcgcgcagt gtggcggcgc atcccccgca 5700 
tccccacacg cgggccgctg ccgctgcccg ccacaccctt gccactttgt gtgctttcct 57 60 
aggatataca cacacacaca cacacacaca cacacacaca cacacacaaa cacaaacaca 5820 
cacgggcgcg ggctttcgtt tcgtttttta acacaaacac acactccccc tgtgctcctc 5880 
aacacactcc atctttctca cacaaacaca cacgcacaca cacatgcgca ggtgtctgac 5940 
cgcgactttg tgatcgagac ggtgtttgcg gccagcctgc tgtgcgtgca cctgtcgcgc 6000 
tgggcggagg acctcatcat ctacagctcc ggccccttcg gctacgtgca gtgcagcgac 6060 
gcctacgcca ccggctcctc gctcatgccg cagaagaaga accccgacgc cctggagctc 6120 
atcaggtgcg ggagggatgg ggtgggggtg ggggggttac attcatggtt agttaagaag 6180 
tgaaggcgta gggggtggat ggggtgggtt acattcatga acatttaaga agtgaaggcg 624G 
tagccaggaa cagtagtaga gcagacgcgt tgtagtgtgt gggtttgggt gggagggatg 6300 
gttgggtaaa gcggtacagg atgtactgag gactgcagac cgaaggagcg ggggaggggg 6360 
agcaggcagg cggggcgagg ggcgtggggg cgggggttac tggcaccgtg ccgggtaagc 6420 
aacacgtgac acggagatgc accacacaaa gagggacgtg gggagtggca ggcgggggcc 64 80 



-286- 

agggctgaga ggcgcgtgtg gaggggtgcg gggttgggcg gggggctgtt tcatgatacc 6540 
gctgcctcca cctcctccac cgcctcctgc cacctccacc tcccccactg cccctccccg 6600 
cctcctcctg ctgcaggggc aagggcggtc gtgtgcaggg caacctgatg ggcgtcatgg 6660 
cggtgctcaa gggcacgccc accacataca acaaggactt ccaggcgaga gagcgagagc 6720 
gagggaggga gggagagcga gggagaggga gggagaggga gggagaggga gacagaggga 67 8 0 
cagggacagg gacagggaca gggacaggga cagggacagg ggcaggggca ggggcagggg 6840 
caggggcagg ggcaggggag gccccccggg ggcggcgggc ccggggcatg aggtcagaca 690 0 
taggggcgct gcactgaggc cgcgaggcgg gcgggaggca gggggcgggg ggcggggggc 6 960 
gggagcggac atgcgccgca aacacagacg ggttgagaaa gcacaacgac tggaacgcag 7020 
tgggcttact gacaattcat cattgtgcgc atatgtgtgt atgtgtatgt gtgtgtttgt 7080 
ttgtgcagga gtgttgggag ctgctgtttg acacggtgga cacggtgcac gacgtggtgc 7140 
gcatcgccac cggcgtgctg tccaccctgc ggatcaagcc cgaccgcatg aaggccggtg 7200 
agcgtagccg agcagggctg gagcagcagc cgggcagcag tagcagcagg gcaggggagc 7260 
agcgggagcg ggagcagcag gaggggtggt tgggaagcgg tgggggtagg gtgggagcgg 7320 
aggaagggaa ggaggagcag gagcaggagg aagaggagga ggaagggcgg tggggggtgg 7380 
ggggtcgtgt ccttggccgc atgggcggag gcggggaggc ggggaggagg cggggaagca 7440 
gagcctgcac ccacgctccg cgggtcccta ccgtcttgcg cctaaccccg tgcgcctagc 7 500 
ctcttgcgcc caccccctta gtgcatcctg tacccctctt tccaaacatc cttgcaactc 7560 
cctgacctcc tcgccaaacc tcccccgccc ccaggcctgt ccgccgacat gctcgccacg 7620 
gacttggccg agtacctggt gcgcaagggc gtgccgttcc gggagacaca ccaccacagg 7680 
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tgcggccggg cgggagggcg tgagggcgtg ggtggggcat gcccggggtt gtgagagcta '^740 
tcgaacgttg tgccgcgcct gtttcacaat gtcgggccac agggtatgca gtttcctctc 7800 
catatgtata acaaactgac caccaatcat gcacgctcac acgctctccc acacacacgc 7860 
gcaccacgcc accacagcgg cgccgccgtg aagatggccg aggaccgcgg ctgcacgctg 7 920 
ttcgacctca ccgtggacga cctcaagacc atccacccgc tcttcaccga cgacgtggcg 7 980 
gcggtgagcg gcggcgcgga gcagcagcag cagcagcagc agcagcagca gcagcagcag 804 0 
tagcctgggg gggagcgtgt gggaggaacg gcgggggagg ggaggcgggg ggtgtcgttt 810 0 
gcagccgagc gcacgtggtg ctttgcccca ttccatgcca gcagggtgac acacctgacc 8160 
atgctggtgt gctgctaggt ggttcacacc tacgtgtgaa tttgtgctgg cgtgcgcaca 8220 
ccttactgtg gccatgtgaa cggcatcctc atgtcctcgt gattgcgccc ggcacattgc 8280 
ccacaacccc gcaccaccca gctcctcaat ccagtgcaag gaaaggaaat gcacgcccgc 8340 
cgcaccaaca acacgacgca tgtgtttgcc acgtgcgcgc acacacgcgc aggtgtggga 8 4 00 
cttcaaccgc agcgccgaga tgcgcgacac ggagggcggc accagcaagc gctcggtgct 8 4 60 
ggagcaggtg cagaagatgc gcacctacct ggcggcggag ggacagcact gagcgggtcg 8520 
ggggaggggg ggcgggtgtg tatgtgtgtg tgtgtgcgtg tgtaagtctc ggtggagggg 8580 
tggtcctcta tatggcggcg gggccacagg gggacgggtg tgacagagti: acggccggag 8 64 0 
ccagcggagt cccgggatgg attaaggatc 8670 



<210> 70 
<211> 745 
<212> DNA 
<213> Unknown 



<220> 
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<223> Description of Unknown Organism: Unknown 
<400> 70 

atgagatggc gacgcgcccc gcgccgctcc gggcgtcccg gcccccgggc ccagcgcccc 60 
ggctccgccg cccgctcgtc gccgccgctg ccgctgctgc cactactgct gctgctgggg 120 
accgcggccc tggcgccggg ggcggcggcc ggcaacgagg cggctcccgc gggggcctcg 180 
gtgtgctact cgtccccgcc cagcgtggga tcggtgcagg agctagctca gcgcgccgcg 240 
gtggtgatcg agggaaaggt gcacccgcag cggcggcagc agggggcact cgacaggaag 300 
gcggcggcgg cggcgggcga ggcaggggcg tggggcggcg atcgcgagcc gccagccgcg 360 
ggcccacggg cgctggggcc gcccgccgag gagccgctgc tcgccgccaa cgggaccgtg 4 20 
ccctcttggc ccaccgcccc ggtgcccagc gccggcgagc ccggggagga ggcgccctat 480 
ctggtgaagg tgcaccaggt gtgggcggtg aaagccgggg gcttgaagaa ggactcgctg 540 
ctcaccgtgc gcctggggac ctggggccac cccgccttcc cctcctgcgg gaggctcaag 600 
gaggacagca ggtacatctt cttcatggag cccgacgcca acagcaccag ccgcgcgccg 660 
gccgccttcc gagcctcttt cccccctctg gagacgggcc ggaacctcaa gaaggaggtc 720 
agccgggtgc tgtgcaagcg gtgcg 74 5 



<210> 71 
<211> 1986 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 71 

gaattccttt tttttttttt ttttttcttt ttttttttgc ccttatacct cttcgccttt 60 
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ctgtggttcc atccacttct tccccctcct 
cccccaataa ataaataaaa ggaggagggc 
ggggaaggaa aagggaggca gcgcgagaag 
agcccgcacg cacctcgcac catgagatgg 
ggcccccggg cccagcgccc cggctccgcc 
ccactactgc tgctgctggg gaccgcggcc 
gcggctcccg cgggggcctc ggtgtgctac 
gagctagctc agcgcgccgc ggtggtgatc 
cagggggcac tcgacaggaa ggcggcggcg 
gatcgcgagc cgccagccgc gggcccacgg 
ctcgccgcca acgggaccgt gccctcttgg 
cccggggagg aggcgcccta tctggtgaag 
ggcttgaaga aggactcgct gctcaccgtg 
CGCtcctgcg ggaggctcaa ggaggacagc 
aacagcacca gccgcgcgcc ggccgccttc 
cggaacctca agaaggaggt cagccgggtg 
ttgaaagaga tgaaaagcca ggaatcggct 
accagttctg aatactcctc tctcagattc 
cgaaaaaaca aaccacaaaa tatcaagata 
attaacaaag catcactggc tgattctgga 
ggaaatgaca gtgcctctgc caatatcacc 



cctcccataa acaactctcc tacccctgca 120 
aaggggggag gaggaggagt ggtgctgcga 180 
agccgggcag agtccgaacc gacagccaga 240 
cgacgcgccc cgcgccgctc cgggcgtccc 300 
gcccgctcgt cgccgccgct gccgctgctg 360 
ctggcgccgg gggcggcggc cggcaacgag 4 20 
tcgtccccgc ccagcgtggg atcggtgcag 480 
gagggaaagg tgcacccgca gcggcggcag 540 
gcggcgggcg aggcaggggc gtggggcggc 600 
gcgctggggc cgcccgccga ggagccgctg 660 
cccaccgccc cggtgcccag cgccggcgag 720 
gtgcaccagg tgtgggcggr gaaagccggg 780 
cgcctgggga cctggggcca ccccgccttc 84 0 
aggtacatct tcttcatgga gcccgacgcc 900 
cgagcctctt tcccccctct ggagacgggc 960 
ctgtgcaagc ggtgcgcctt gcctccccaa 1020 
gcaggttcca aactagtcct tcggtgtgaa 1080 
aagtggttca agaatgggaa tgaattgaat 1140 
caaaaaaagc cagggaagtc agaacttcgc 1200 
gagtatatgt gcaaagtgat cagcaaatta 1260 
atcgtggaat caaacgctac atctacatcc 1320 
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accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat 1380 
ggaggggagt gcttcatggt gaaagacctt tcaaacccct cgagatactt gtgcaagtgc 144 0 
ccaaatgagt ttactggtga tcgctgccaa aactacgtaa tggccagctt ctacagtacg 1500 
tccactccct ttctgtctct gcctgaatag gagcatgctc agttggtgct gctttcttgt 1560 
tgctgcatct cccctcagat tccacctaga gctagatgtg tcttaccaga tctaatattg 1620 
actgcctctg cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 1680 
gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt tctggaattg 1740 
atattgaatg atgtgataca aattgatagt caatatcaag cagtgaaata tgataataaa 1800 
ggcatttcaa agtctcactt ttattgataa aataaaaatc attctactga acagtccatc 1860 
ttctttatac aatgaccaca tcctgaaaag ggtgttgcta agctgtaacc gatatgcact 1920 
tgaaatgatg gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 1980 
aaaagg 1986 

<210> 72 
<211> 2003 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 



<222> (32) 

<223> May be any nucleic acid 
<400> 72 

ggaattcctt tttttttttt tttttttctt 
tttctgtggt tccatccact tcttccccct 
gcacccccaa taaataaata aaaggaggag 
cgaggggaag gaaaagggag gcagcgcgag 
agaagcccgc acgcacctcg caccatgaga 
cccggccccc gggcccagcg ccccggctcc 
ctgccactac tgctgctgct ggggaccgcg 
gaggcggctc ccgcgggggc ctcggtgtgc 
caggagctag ctcagcgcgc cgcggtggtg 
cagcaggggg cactcgacag gaaggcggcg 
ggcgatcgcg agccgccagc cgcgggccca 
ctgctcgccg ccaacgggac cgtgccctct 
gagcccgggg aggaggcgcc ctatctggtg 
gggggcttga agaaggactc gctgctcacc 
ttcccctcct gcgggaggct caaggaggac 
gccaacagca ccagccgcgc gccggccgcc 
ggccggaacc tcaagaagga ggtcagccgg 
caattgaaag agatgaaaag ccaggaatcg 
gaaaccagtt ctgaatactc ctctctcaga 
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nntttttttt tgcccttata cctcttcgcc 60 
cctcctccca taaacaactc tcctacccct 120 
ggcaaggggg gaggaggagg agtggtgctg 18 0 
aagagccggg cagagtccga accgacagcc 240 
tggcgacgcg ccccgcgccg ctccgggcgt 300 
gccgcccgct cgtcgccgcc gctgccgctg 360 
gccctggcgc cgggggcggc ggccggcaac 42 0 
tactcgtccc cgcccagcgt gggatcggtg 480 
atcgagggaa aggtgcaccc gcagcggcgg 54 0 
gcggcggcgg gcgaggcagg ggcgtggggc 600 
cgggcgctgg ggccgcccgc cgaggagccg 660 
tggcccaccg ccccggtgcc cagcgccggc 72 0 
aaggtgcacc aggtgtgggc ggtgaaagcc 780 
gtgcgcctgg ggacctgggg ccaccccgcc 840 
agcaggtaca tcttcttcat ggagcccgac 900 
ttccgagcct ctttcccccc tctggagacg 960 
gtgctgtgca agcggtgcgc cttgcctccc 1020 
gctgcaggtt ccaaactagt ccttcggtgt 1080 
ttcaagtggt tcaagaatgg gaatgaattg 1140 
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aatcgaaaaa acaaaccaca aaatatcaag atacaaaaaa agccagggaa gtcagaactt 1200 
cgcattaaca aagcatcact ggctgattct ggagagtata tgtgcaaagt gatcagcaaa 1260 
ttaggaaatg acagtgcctc tgccaatatc accatcgtgg aatcaaacgc tacatctaca 1320 
tccaccactg ggacaagcca tcttgtaaaa tgtgcggaga aggagaaaac tttctgtgtg 1380 
aatggagggg agtgcttcat ggtgaaagac ctttcaaacc cctcgagata cttgtgcaag 1440 
tgcccaaatg agtttactgg tgatcgctgc caaaactacg taatggccag cttctacagt 1500 
acgtccactc cctttctgtc tctgcctgaa taggagcatg ctcagttggt gctgctttct 1560 
tgttgctgca tctcccctca gattccacct agagctagat gtgtcttacc agatctaata 1620 
ttgactgcct ctgcctgtcg catgagaaca ttaacaaaag caattgtatt acttcctctg 1680 
ttcgcgacta gttggctctg agatactaat aggtgtgtga ggctccggat gtttctggaa 17 40 
ttgatattga atgatgtgat acaaattgat agtcaatatc aagcagtgaa atatgataat 1800 
aaaggcattt caaagtctca cttttattga taaaataaaa atcattctac tgaacagtcc 1860 
atcttcttta tacaatgacc acatcctgaa aagggtgttg ctaagctgta accgatatgc 1920 
acttgaaatg atggtaagtt aattttgatt cagaatgtgt tatttgtcac aaataaacat 1980 
aataaaagga aaaaaaaaaa aaa 2003 

<210> 73 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
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<222> (809) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (810) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (811) 

<223> May be any nucleic acid 



<400> 73 

tctcgcccca actttttccc ccgcgctccg cagcagcagc agcagcagca gcagcagcag 60 
caaaatggca gacctcttca gcggactcgt gggcggcgtc gtcggcgctg ttgctgcagc 120 
agatttgcct gcggagggcg agagggcccc ocgccccgcc cccggcactg cctggacttg 180 
ctgctgcagc aaactgcaag aaggggcccg cgagctggag ggttttgtgc agcagctgag 24 0 
ttttgttgca gggaagctgg cctgctgcct gcgggtgggg gcggagcagc tggcgcgctg 300 
cgctgcggag gggcggctgc ccagcagcag cagcagcagc agctgctgcg cgctgctgca 360 
gctcgagaag caggacctcg agcagagcct cgaggccggc aagcagggcg cggagtgcct 420 
cttgaggagc agcaaactgg ccctcgaggc cctcctcgag ggggcccgcg ttgcagcaac 480 
gcggggtttg ctgctggtcg agagcagcaa agacacggtg ctgcgcagca ttccccacac 54 0 
ccaggagaag ctggcccagg cctacagttc tttcctgcgg ggctaccagg gggcagcagc 600 
ggggaggtct ctgggctacg gggcccctgc tgctgcttac ggccagcagc agcagcccag 660 
cagctacggg gcgccccccg cctccagcca gcagccctcc ggcttcttcr ggtagccctg 720 
cagcagcagc agcagcagca gcagcagcag cagcgcgggc ggcagccgcg gcggggccgg 780 
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ggcgccgctg cagcaacagc agcagccgnn 
actccacagg cagcgggaga gcagcaggga 
gcgccagctg cagcagcagc agcagcagca 



ncggctagcg ccgcggagca ctcgcaggga 8 40 
cgagaagcag gtcatgtagc gcaggcagca 900 
gcagcagcag cagcagctcc tgcaccg 957 



<210> 74 
<211> 957 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (809) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (810) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (811) 

<223> May be any nucleic acid 



<400> 74 

tctcgcccca actttttccc ccgcgctccg 
caaaatggca gacctcttca gcggactcgt 
agatttgcct gcggagggcg agagggcccc 
ctgctgcagc aaactgcaag aaggggcccg 
ttttgttgca gggaagctgg cctgctgcct 
cgctgcggag gggcggctgc ccagcagcag 



cagcagcagc agcagcagca gcagcagcag 60 
gggcggcgtc gtcggcgctg ttgctgcagc 120 
ccgccccgcc cccggcactg cctggacttg 180 
cgagctggag ggttttgtgc agcagctgag 240 
gcgggtgggg gcggagcagc tggcgcgctg 300 
cagcagcagc agctgctgcg cgctgctgca 360 
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gctcgagaag caggacctcg agcagagcct cgaggccggc aagcagggcg cggagtgcct 420 
cttgaggagc agcaaactgg ccctcgaggc ccrcctcgag ggggcccgcg ttgcagcaac 480 
gcggggtttg ctgctggtcg agagcagcaa agacacggtg ctgcgcagca ttccccacac 540 
ccaggagaag ctggcccagg cctacagttc tttcctccgg ggctaccagg gggcagcagc 600 
ggggaggtct ctgggctacg gggcccctgc tgctgcttac ggccagcagc agcagcccag 660 
cagctacggg gcgccccccg cctccagcca gcagccctcc ggcttcttct ggtagccctg 720 
cagcagcagc agcagcagca gcagcagcag cagcgcgggc ggcagccgcg gcggggccgg 780 
ggcgccgctg cagcaacagc agcagccgnn ncggctagcg ccgcggagca ctcgcaggga 84 0 
actccacagg cagcgggaga gcagcaggga cgagaagcag gtcatgtagc gcaggcagca 900 
gcgccagctg cagcagcagc agcagcagca gcagcagcag cagcagctcc tgcaccg 957 



<210> 75 
<211> 1089 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism; Unknown 
<220> 

<221> UNSURE 
<222> (376) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (377) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
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<222> (847) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (848) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (849) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (850) 

<223> May be any nucleic acid 



<400> 75 

gaattccctc caactcttcg cgactctctc tctctcgccc caactttttc ccccgcgccc 60 
cgcagcagca gcagcagcag cagcagcaaa atggcagacc tcttcagcgg actcgtgggc 120 
ggcgtcgtcg gcgctgttgc tgcagcagat ttgcctgcgg agggcgagag ggccccccgc 180 
cccgcccccg gcactgcctg gacttgctgc tgcagcaaac tgcaagaagg ggcccgcgag 24 0 
ctggagggtt ttctgcagca gctgagtttt gttgcaggga agctggcctg ctgcctgcgg 300 
gtgggggcgg agcagctggc gcgctgcgct gcggaggggc ggctgcccag cagcagcagc 360 
agcagcagct gctgcnngct gctgcagctc gagaagcagg acctcgagca gagcctcgag 420 
gccggcaagc agggcgcgga gtgcctcttg aggagcagca aactggccct cgaggccctc 480 
ctcgaggggg cccgcgttgc agcaacgcgg ggtttgctgc tggtcgagag cagcaaagac 540 
acggtgctgc gcagcattcc ccacacccag gagaagctgg ctcaggccta cagttctttc 600 
ctgcggggct accagggggc agcagcgggg aggtctctgg gctacggggc ccctgctgct 660 
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gcttacggcc agcagcagca gcccagcagc tacggggcgc cccccgcctc cagccagcag 720 
ccctccggct tcttctggta gccctgcagc agcagcagca gcagcagcag cagcagcagc 780 
ggcggcggca gccgcggcgg ggccggggcg ccgctgcagc aacagcagca gccgcggcgg 840 
ctagcgnnnn gagcactcgc agggaactcc acaggcagcg ggagagcagc agggacgaga 900 
agcaggtcta tgtagcgcag gcagcagcgc cagctgcagc agcagcagca gcagcagcag 960 
cagcagcagc agctcctgca ccgcagcgtt gtgtcattta ttacgttggc agctctgagg 1020 
Gctcggcgca gccaacgcgc ctcaggtatc tttcagactc ttttctctaa ggtcttccag 1080 
acggaattc 1089 

<210> 76 
<211> 1985 
<212> DNA 
<213> anknown 

<220> 

<223> Description of Unknovm Organism: Unknown 
<400> 76 

cgccgagctt tcggcacctc tgccgggtgg taccgagcct tcccggcgcc ccctcctctc 60 
ctcccaccgg cctgcccttc cccgcgggac tatcgccccc acgtttccct cagccctttt 120 
ctctcccggc cgagccgcgg cggcagcagc agcagcagca gcagcaggag gaggagcccg 180 
gtggcggcgg tggccgggga gcccatggcg tacagtcaag gaggcggcaa aaaaaaagtc 24 0 
tgctactact acgacggtga tattggaaat tattattatg gacagggtca tcccatgaag 300 
cctcatagaa tccgcatgac ccataacttg ctgttaaatt atggcttata cagaaaaatg 360 
gaaatatata ggccccataa agccactgcc gaagaaatga caaaatatca cagtgatgag 420 
tatatcaaat ttctacggtc aataagacca gataacatgt ctgagtatag taagcagatg 480 
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catatattta atgttggaga agattgtcca gcgtttgatg gactctttga gttttgtcag 54 0 
ctctcaactg gcggrtcagt tgctggagct gtgaagttaa accgacaaca gactgatarg 600 
gctgttaatt gggctggagg attacatcat gctaagaaat acgaagcatc aggattctgt 660 
tacgttaatg atattgtgct tgccatcctt gaattactaa agtatcatca gagagtctta 720 
tatattgata tagatattca tcatggtgat ggtgttgaag aagcttttta tacaacagat 780 
cgtgtaatga cggtatcatt ccataaatat ggggaatact ttcctggcac aggagacttg 84 0 
agggatattg gtgctggaaa aggcaaatac tatgctgtca attttccaat gtgtgatggt 900 
atagatgatg agtcatatgg gcagatattt aagcctatta tctcaaaggt gatggagatg 960 
tatcaaccta gtgctgtggt attacagtgt ggtgcagact cattatctgg tgatagactg 1020 
ggttgtttca atctaacagt caaaggtcat gctaaatgtg tagaagttgt aaaaactttt 1080 
aacttaccat tactgatgct tggaggaggt ggctacacaa tccgtaatgt tgctcgatgt 1140 
tggacatatg agactgcagt tgcccttgat tgtgagattc ccaatgagtt gccatataat 1200 
gattactttg agtattttgg accagacttc aaactgcata ttagtccttc aaacatgaca 1260 
aaccagaaca ctccagaata tatggaaaag ataaaacagc gtttgtttga aaatttgcgc 1320 
atgttacctc atgcacctgg tgtccagatg caagctattc cagaagatgc tgttcatgaa 1380 
gacagtggag atgaagatgg agaagat cca gacaagagaa tttctattcg agcatcagac 14 40 
aagcggatag cttgtgatga agaattctca gattctgagg atgaaggaga aggaggtcga 1500 
agaaatgtgg ctgatcataa gaaaggagca aagaaagcta gaattgaaga agataagaaa 1560 
gaaacagagg acaaaaaaac agacgttaag gaagaagata aatccaagga caacagtggt 1620 
gaaaaaacag ataccaaagg aaccaaatca gaacagctca gcaacccctg aatttgacag 1680 
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tctcaccaat ttcagaaaat cattaaaaag aaaatattga aaggaaaatg ttttcttttt 1740 
gaagacttct ggcttcattt tatactactt tggcatggac tgtatttatt ttcaaatggg 1800 
actttttcgt ttttgttttt ctgggcaagt tttattgtga gattttctaa ttatgaagca 1860 
aaatttcttt tctccaccat gctttatgtg atagtattta aaattgatgt gagttattat 1920 
gtcaaaaaaa ctgatctatt aaagaagtaa ttggcctttc tgagctgaaa aaaaaaaaaa 1980 
aaaag 1985 

<210> 77 
<211> 476 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 77 

ccaccctcct ccccctcccc cggccacttc gctaacttgg tggctgttgt gatgcgtatt 60 
cctgtagatc cgagcaccag ccggcgcttc agccccccct ccagcagcct gcagcccggc 120 
aaaatgagcg acgtgagccc ggtggtggct gcgcaacagc agcagcaaca gcagcagcag 180 
caacagcagc agcagcagca gcaacagcag cagcagcagc aggaggcggc ggcggcggct 240 
gcggcggcag cggcggctgc ggcggcggca gctgcagtgc cccggttgcg gccgccccac 300 
gacaaccgca ccatggtgga gatcatcgcc gaccacccgg ccgaactcgt ccgcaccgac 360 
agccccaact tcctgtgctc ggtgctgccc tcgcactggc gctgcaacaa gaccctgccc 420 
gtggccttca aggtaagagg ctaccccgcc ccccgccccc ggccgggagc ggcgga 47 6 

<210> 78 
<211> 24 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 78 

gcattttgga tccgcctttt catg 

<210> 79 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 79 

gttgtgtgct gcagattgtt cc 

<210> 80 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 80 

gaaaaatggg gatccgaggt g 

<210> 81 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence : DNA Primer 



24 



Sequence: DNA Primer 



22 



Sequence : DNA Primer 



21 



Sequence: DNA Primer 



[ 
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<400> 81 

gcaggagaat tccgtccatg 

<210> 82 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (3) 

<223> Can be any amino acid 
<400> 82 

Trp Ser Xaa Trp Ser 
1 5 



<210> 83 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 83 

Cys Ser Val Thr Cys Gly 
1 5 



<210> 84 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (4) 

<223> Can be any amino acid 



<400> 84 

Gly Cys Gin Xaa Arg 



1 5 

<210> 85 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gggatccgga gcccaaatct tctgacaaaa 
aattcgaggg tgcaccgtca gtcttcctct 
tctcccggac tcctgaggtc acatgcgtgg 
tcaagttcaa ctggtacgtg gacggcgtgg 
aggagcagta caacagcacg taccgtgtgg 
ggctgaatgg caaggagtac aagtgcaagg 
agaaaaccat ctccaaagcc aaagggcagc 
catcccggga tgagctgacc aagaaccagg 
atccaagcga catcgccgtg gagtgggaga 
ccacgcctcc cgtgctggac tccgacggct 
acaagagcag gtggcagcag gggaacgtct 
acaaccacta cacgcagaag agcctctccc 
gactctagag gat 

<210> 86 
<211> 86 
<212> DNA 

<213> Artificial Sequence 



-302- 



ctcacacatg cccaccgtgc ccagcacctg 60 
tccccccaaa acccaaggac accctcatga 120 
tggtggacgt aagccacgaa gaccctgagg 180 
aggtgcataa tgccaagaca aagccgcggg 240 
tcagcgtcct caccgtcctg caccaggact 300 
tctccaacaa agccctccca acccccatcg 360 
cccgagaacc acaggtgtac accctgcccc 420 
tcagcctgac ctgcctggtc aaaggcttct 480 
gcaatgggca gccggagaac aactacaaga 540 
ccttcttcct ctacagcaag ctcaccgtgg 600 
tctcatgctc cgtgatgcat gaggctctgc 660 
tgtctccggg taaatgagtg cgacggccgc 720 

733 
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<220> 

<223> Description of Artificial Sequence :DNA Primer 
<400> 86 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 86 

<210> 87 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 
<400> 87 

gcggcaagct rtttgcaaag cctaggc 27 

<210> 88 
<211> 271 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Fragment 
<400> 88 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 
aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 
gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 
ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 
ttttggaggc ctaggctttt gcaaaaagct t 271 
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<210> 89 

<211> 32 

<212> DNA 

<213> Homo sapiens 



<400> 89 

32 

gcgctcgagg gatgacagcg atagaacccc gg 



<210> 90 

<211> 31 

<212> DNA 

<213> Homo sapiens 



<400> 90 

31 

gcgaagcttc gcgactcccc ggatccgcct c 



<210> 91 

<211> 12 

<212> DNA 

<213> Homo sapiens 



<400> 91 

12 

ggggactttc cc 



<210> 92 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 92 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 

73 

ccatctcaat tag 



<210> 93 
<211> 27 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Fragment 
<400> 93 

gcggcaagct ttttgcaaag cctaggc 27 



<210> 94 

<211> 652 

<212> DNA 

<213> Homo sapiens 



<400> 94 

ggcataagat cacactttag 
ccctgcaggt ggcagcctga 
aggtggacga acagcggtgc 
cagggcttgg agtcgtcgtg 
tgggccgcct ggagcccctc 
gtcccgatgt ctgccacacc 
ctggtgagca ggatggccgt 
cgccgctgcc agttgcagaa 
ccccattttt catcttctac 
atgctggggt gcttgtagaa 
caggtcggcc cgtaaagggc 



ttcagagaca catttgcata 
gaacatggcg ctgcaggggg 
catcacgtgg tgcttgccca 
gggcatgctg aggacgtgcc 
atcctcgatc acggagcagc 
cagggtgtca cacagcccct 
gtcgtagtgc tctgggtggc 
gttacgcagt gtaagccccc 
gatcagcact tttaccacca 
tcgggcttgc cacgaaaatt 
gccatggacg catcggccac 



aatacttgaa atggatccac 60 
accagggcag cgtctggttc 120 
tgggcccgaa gagccgtgtg 180 
ctagttcatg ggccagggtg 240 
ttttgttggg gtcacaaatg 300 
cctgcccaca gaagttctgt 360 
ggtcgctggg ctggttgaaa 420 
cattgtcgga cacctctggg 480 
tcangttgat ggaautcttg 540 
aacctcagga tgtggttctg 600 
caacagcgtt tc 652 



<210> 95 

<211> 716 

<212> CDNA 

<213> Homo sapiens 



<400> 95 

taagtttgct agtcctttgc 
gcacttttac cttttaacct 
acagttgatg atagggtctg 
cagtactgga ggggttgcgg 
caagggaccc agtcaccaca 
agcacaagac tggcccctgc 
ttggcatcct cgggtttcag 
ctgcactcta cagttcgcct 
tcccccagca cccactgtgc 



aaacagactg acgctgagtg 
atgccctcta cttgaacccg 
ccgccccata ccctctcctc 
caatgggagg cctgggtggg 
gtggagaccc ttgtctgcac 
ccccctgaat cacagggggc 
agccttgttg caggtggcag 
ctgccagccg gccccgcagg 
gtggagcagc ggctggatga 



tcctgtctga gtcaataagt 60 
agcaaggtcc agtccactgg 120 
ttccccctta ggaatttgtg 180 
ccgtgctgcc ttgatatggc 240 
ctcagtaccg catgtccagg 300 
acagctggct ttcgcagggc 350 
aggcctggcc ggaggggtcc 420 
tgctagagca ctcagaccag 480 
tgttggtggt tgctctctct 540 



-306- 



ttgctgctct gcatgctaaa agtcacgtca 
ttggggggaa gaacctcgcc caggactgtc 
ggcccngaag ctctgcangc gctccagggt 



ttaggaanca aagaaggtgt atttgacttt 600 
aggagctgca ctgtcagaag gctctgcnaa 660 
ggcgatggag ccgtgtactt caggat 716 



<210> 96 

<211> 543 

<212> DNA 

<213> Homo sapiens 



<400> 96 

ggcataagat cacactttag ttcagagaca catttgcata aatacttgaa atggatccac 60 
ccctgcaggt ggcagcctga gaacatggcg ctgcaggggg accagggcag cgtctggttc 120 
aggtggacga acagcggtgc catcacgtgg tgcttgccca tggcctcgaa gagccgtgtg 180 
cagggcttgg agtcgtcgtg gggcatgctg aggacgtgcc ctagttcatg ggccagggtg 240 
tgggccgctg gagccctcat cctcgatcac ggagcagctt ttgttggggt cacaaatggt 300 
cccgatgtct gccacaccca gggtgtcaca cagcccctcc tgcccacaga agttctgtct 360 
ggtgagcagg atggccgtgt cgtagtgctc tgggtggcgg tcgctgggct ggttgaaacg 420 
ccgctgccag ttgcagaagt tacgcagtgt aaggccccca ttgtcggaca gctctggggc 480 
ccatttttca tcttctacga tcagcacttt taaccacatc aggttgatgg aattcttgat 540 
gcc 543 



<210> 97 
<211> 377 
<212> DNA 
<213> Mus 



musculus 



<400> 97 

gcaaagtgcc accacccttc ggatccaaaa ctagaagcaa gaggtttgtg tccgaggctc 60 
gcttcgtgga aacacttctg gtggctgatg cgtccatggc tgccttctat gggaccgacc 120 
tgcagaacca catcctcacg gtgatgtcaa tggcagcccg aatctacaag cacccgagca 180 
tcaagaactc cgtcaacctt gtggtggtga aagtgctaat agtggaagag gaaggatggg 24 0 
gcccggaggt gtcggacaac ggggggctca cactgcgcaa cttctgcagc tggcaacggc 300 
gtttcaacaa gcccagtgac cgccacccgg agcactatga cactgccatc ttgttcacca 360 
gacagaactt ctgtggg 377 



<210> 98 
<211> 432 
<212> DNA 

<213> Rattus norvegicus 



<400> 98 



-307- 



ctaaagtaca gtggttccat ggccaccctg gagcggctgc anagcttcca agccctccct 60 
gagcctctta cagtacagct cctgactgtg tctggtgagg tcttccctcc aaaagtcaaa 120 
tataccttct tcgtccccaa tgacacggac ttcaacgtgc agagtagcaa agaaagagca 180 
agcaccaaca tcattcagtc cttgccctat gcanagtggg tgctggggga ctggtctgaa 240 
tgtccaagca catgtggagg tggctggcag cggcggactg tggaatgcag ggacccctca 300 
ggtcaggcct ctgacacctg tgatgaggct ctgaaacctg aggatgccaa gccctgtgga 360 
agccagccat gtctcctctg atccccttgg tggacatgtc taaggcttat ggatttgggc 420 
tactggcgtt tt 432 



<210> 99 

<211> 354 

<212> DNA 

<213> Mus musculus 



<400> 99 

caaagtgcac cacccttcgg atccaaaact 
ttcgtggaaa cacttctggt ggctgatgcg 
cagaaccaca tcctcacggt gatgtcaatg 
caggaactcc gtcaaccttg tggtggtgaa 
cccggagtgt cggacaacgg ggggctcaca 
ttcaacaagc ccagtgaccg ccacccggag 



agaagcaaga 


ggtttgtgtc 


cgaggctcgc 


60 


tccatggctg 


ccttctatgg 


gaccgacctg 


120 


gcagccacga 


atctacaagc 


acccgagcat 


180 


agtgctaata 


gtggaagagg 


aaggatgggg 


240 


ctgcgcaact 


tctgcagctg 


gcaacggcgt 


300 


cactatgaca 


ctgccatctt 


gttc 


354 



<210> 100 
<211> 389 
<212> DNA 
<213> Homo sapiens 

<400> 100 

ttgtgcGcag aagacactgg ccctggggcc tgggttgagt tcaaaaccaa aagaaagaag 60 
aaaagtgctg taaattcggg atttctccac cggatgctcc tgcttccgca tgggtgtcac 120 
ctccatgccg ttcctncctc tttctaggga aaagcttcag ggagcagcag tgtgagaagt 180 
ataatgccta caattacact gacatggacg ggaatctcct gcagtgggtc cccaagtatg 240 
ctggggtgtc cccccgggac cgcctggcaa gttgttctgc cgagcccggg ggaggagcga 300 
gttcaaagtg ttcgaggcca aggtgagaat caccctgggg gacttcagat ccagagatgg 360 
ggggagggaa ggtcggcctg ttccccaca 38 9 



<210> 101 
<211> 305 
<212> DNA 
<213> Homo sapiens 



-308- 



<400> 101 

catttaagtt tgctagtcct ttgcaaacag 
aagtgcactt ttacctttta acctatgccc 
ctggacangt tgatgatagg gtctgncgcc 
ttgtgcagta ctggaggggt tgcggcaatg 
atggccaagg gacccagtca ccacagtgga 
ccagg 



actgacgctg agtgtcctgt ctgagtcaat 60 
tctacttgaa cccgagcaag gtccagtcca 120 
ccataccctc tcctcttccc ccttaggaat 180 
ggaggcctgg gtgggccgtg ctgccttgat 240 
gacccttgtc tgcacctcag taccgcatgt 300 

305 



<210> 102 
<211> 152 
<212> DNA 
<213> Homo sapiens 



<400> 102 

atcgtagaag atgaaaaatg gggcccagag 
aacttctgca actggcagcg gcgtttcaac 
gncacggcca tcctnctcac cagacagaac 



gtgtccgaca atggggggct tacactgcgt 60 
cagcccagcg accgncaccc agagcactac 120 
tt 152 



<210> 103 
<211> 632 
<212> DNA 
<213> Homo sapiens 



<400> 103 

tttaataata ataatgcccg gggctttatt 
cctcactaac tatcctatca aatttgcaac 
ttctatctac ccccataatc ccaccttact 
gctggatccc tccagccttc ttgctttccc 
ctcaaagcta acaccactta aaccacttaa 
aaatgtttag gtttctttaa aggatcacag 
gacagacact tcaagcttct ttttttgtaa 
cactccccca gctgccactg ggggcaggga 
ttcacttcct ttgcacactc ggaagcaggc 
cttctctgcc aacccaattc acatgactta 
gctgaaaaag tggggatagc attgaaagaa 



atgctgtatc actgctcaga ggttaataat 60 
tggcagttta ctctgatgat tcaactcctt 120 
gatacacctc actggttact ggcaagatac 180 
tgcaccagcc cttcctcact ttgccttgcc 240 
ctgcattctg ccattgtgca aaagtctatg 300 
ctctcatgag ataacacccc tccatcatgg 360 
Gccttcccac aagtcttaga acatgatgac 420 
tggtctgcac aaggtctggt gctggctggc 480 
tgtccattaa tgtctcggca ttctaccagt 540 
gaacattcgc cccactcttc aatgacccat 600 
tc 632 



<210> 104 
<211> 519 
<212> DNA 
<213> Homo sapiens 



-309- 



<400> 104 

tttttttcta aacttgtaat agatgtaaca aaagaaataa taataataat gcccggggct 60 
ttattatgct atatcacrgc tcagaggtta ataatcctca ctaactatcc tatcaaattt 120 
gcaactggca gtttactctg atgattcaac tccttttcta tctaccccca taatcccacc 180 
ttactgatac acctcactgg ttactggcaa gatacgctgg atccctccag ccttcttgct 240 
ttccctgcac cagcccttcc tcactttgcc ttgccctcaa agctaacacc acttaaacca 300 
cttaactgca ttctgccatt gtgcaaaagt ctatgaaatg tttaggtttc tttaaaggat 360 
cacagctctc atgagataac acccctccat catgggacag acacttcaag cttctttttt 420 
tgtaaccctt cccacaggtc ttagaacatg atgaccactc ccccagctgc cactgggggc 4 80 
agggatgtct gcacaagggc tggtgctggc tgcccggac 519 

<210> 105 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<400> 105 

gagtcatgat gcgatcacaa ccagctttta cacactgtcc ttgcacacag acagaggtgg 60 

aatctgggct acatggagta ccatctacaa ccttgggctg caaaacgaag aagtagccaa 120 

tgcctttggc ttggcagatg agcttgcacc tgtcctttgg tgagacgcca gcgtacttgg 180 

gaatccattc caccgcaggc ccactcccaa aggaagcttt tgaaaactcg ttgtgtgctt 240 

cacattgttc ctctctaaag gtttttccat tattgtctgg acagtcctca aggttacagg 300 

atctgtagcg cactcgtttg ccttcacagt acttccctcc attctttggg actgggttgt 360 

cacattccct catcgtgtac tggactcctc caccgcacgt tctcgaacag tctccccaag 420 

gcccccacat tccccagctt ccatgaaaag gcgtatcaaa atgctttctg tcggt 475 

<210> 106 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<400> 106 

aataataata atgcccgggg ctttattatg ctgtatcact gctcagaggt taataatcct 60 

cactaactat cctatcaaat ttgcaactgg cagtttactc tgatgattca actccttttc 120 

tatctacccc cataatccca ccttactgat acacctcact ggttactggc aagatacgct 180 

ggatccctcc agccttcttg ctttccctgc accagccctt cctcactttg ccttgccctc 240 

aaagctaaca ccacttaaac cacttaactg cattctgcca ttgtgcaaaa gtctatgaaa 300 

tgtttaggtt tctttaaagg atcacagctc tcatgagata acacccctcc atcatgggac 360 

agacacttca agcttctttt tttgtaaccc ttcccacagg tcttagaaca tgatgaccac 420 

tcccccagct gccactgggg gcagggatgg tctgg 455 



-310- 



<210> 107 
<211> 515 
<212> DNA 
<213> Homo sapiens 



<400> 107 

aacccttccc acaggtctta gaacatgatg 
ggatggtctg cacaaggtct ggtgctggct 
ggctgtccat taatgtctcg gcattcttcc 
ttagaacatt cgccccactc ttcaatgacc 
gattccttct tcttctttac gaagtaggtg 
cagtaagaac ctggatggtc aagggctctt 
aatgccgcag aggagccgct gtacctcaag 
aaggtggaca aagtgtagtc accataaaga 
aggaagctgt cattgttctt ggatccctct 



accactcccc cagctgccac tgcggggcag 60 
ggcttcactt cctttgcaca ctcggaagca 120 
agtcttctct gccaacccaa ttcacatgac 180 
catgctgaaa aagtggggat agcattgaaa 240 
tatttaattt taggtcgaag ggcattgcca 300 
tggagcaggc taaagctgcg aattctttcc 360 
acaacacctt tgtacataat gtcttgctct 420 
atatatgtgc catcagcagc ttrtgatggc 480 
gttcc 515 



<210> 108 
<211> 359 
<212> DNA 
<213> Homo sapiens 



<400> 108 

acttcgtaaa gaagaagaag gaatctttca 
ttgaagagtg gggcgaatgt tctaagtcat 
aatgccgaga cattaatgga cagcctgctt 
gcaccagacc ttgtgcagac catccctgcc 
tgttctaaga cctgcgggaa gggttacaaa 
atggaggggt gttatctcat tgagagctgt 



atgctatccc cactttttca gcatgggtca 60 
gtgaattggg ttggcagaga agactggtag 120 
ccgagtgtgc aaaggaagtg aagccagcca 180 
cccagtggca gctgggggaa gtggtcatca 240 
aaaagaagct ttgaagtgtc ttgtcccatg 300 
gatcctttaa agaaacctaa acatttcat 359 



<210> 109 
<211> 320 
<212> DNA 
<213> Homo sapiens 



<400> 109 

cagagaacat tcgccccact cttcaatgac 
agattccttc ttcttcttta cgaagtaggt 
cacagtaaga acctggatgg tcaagggctc 
caatgccgca gaggagccgc tgtacctcaa 
taaggtggac aaagtgtagt caccattaag 
aagaaagctg cccttgttcc 



ccatgctgaa aaagtgggga tagcattgaa 60 
gtatttaatt ttaggtcgaa gggcattgcc 120 
tttgagaggg cuaaagctgc gaattctttc 180 
gacaacacct ttgtacataa tgtcttgctc 240 
aatatatgtg ccatcagcag ctttgatggc 300 

320 



-311- 



<210> 110 
<211> 316 
<212> DNA 
<213> Homo sapiens 



<400> 110 

aatgccgaga cattaatgga 
gcaccagacc ttgtgcagac 
gttctaagac ctgtgggaag 
gaggggtgtt atctcatgag 
tttgcacaat ggcagaatgc 
aaagtgagga agggct 



cagcctgctt ccgagtgtgc 
catccctgcc cccagtggca 
ggttacaaaa aaagaagctt 
agctgtgatc ctttaaagaa 
agttaagtgg tttaagtggt 



aaaggaagtg aagccagcca 60 
gctgggggag tggtcatcat 120 
gaagtgtctg tcccatgatg 180 
acctaaacat ttcatagact 240 
gttagctttg agggcaaggc 300 

316 



<210> 111 
<211> 318 
<212> DNA 
<213> Homo sapiens 



<400> 111 

agantnccga gacattaatg 
cagcaccaga ccttgtgcag 
atgttctaag acctgtggga 
tggaggggtg ttatctcatg 
cttttgcaca atggcagaat 
gcaaagtgag gaagggct 



gacagcctgc ttccgagtgt 
accatccctg cccccagtgg 
agggttacaa aaaaagaagc 
agagctgtga tcctttaaag 
ncagttaagt ggtttaagtg 



gcaaaggaag tgaagccagc 60 
cagctggggg agtggtcatc 120 
ttgaagtgtc tgtcccatga 180 
aaacctaaac atttcataga 240 
gtgttagctt tgagggcaag 300 

318 



<210> 112 
<211> 314 
<212> DNA 
<213> Homo sapiens 



<400> 112 

ttttttttct aaacttgtaa 
tttattatgc tatatcactg 
tgcaactggc agtttactct 
cttactgata cacctcactg 
tttccctgca ccagcccttc 
acttaactgc attc 



tagatgtaac aaaagaaata 
ctcagaggtt aataatcctc 
gatgattcaa ctccttttct 
gttactggca agatacgctg 
ctcactttgc cttgccctca 



ataataataa tgcccggggc 60 
actaactatc ctatcaaatt 120 
atctaccccc ataatcccac 180 
gatccctcca gccttcttgc 240 
aagctaacac cacttaaacc 300 

314 



<210> 113 



-312- 



<211> 316 
<212> DNA 
<213> Homo sapiens 



<400> 113 

aaggaatcct tcaatgctat ccccactgtt tcagcatggg tcattgaaga gtggggcgaa 60 
tgttctaagt catgtgaatt gggttggcag aaaagacttg tagaatgccg agacattaat 120 
ggacagcctg cgtccgagtg tgcaaaggaa gtgaagccag ccagcaccag accttgtgca 180 
gaccatccct gcccccagtg gcagctgggg ggagtggtca tcatgttcta agacctgtgg 240 
gaaggggtac aaaaaaagag gcgtgaagtg tctgtcccat gatggagggg tttatctcat 300 
gagaactgtg atcctt 316 



<210> 114 
<211> 265 
<212> DNA 
<213> Homo sapiens 



<400> 114 

agcagtttan ncctntcaaa gagcccttga ccatccaggt tcttactgtg ggcaatgccc 60 
ttcgacctaa aattaaatac acctacttcg taaagangaa gaaggaatct ttcaatgcta 120 
tccccacttt ttcagcatgg gtcattgaag agtggggcga atgttctaag tcatgtgaat 180 
tgggttggca gagaagactg gtagaatgcc gagacattaa tggacagcct ncttccgagt 240 
gtgcaaagna agtgaagcca gccag 2 65 



<210> 115 
<211> 334 
<212> DNA 
<213> Mus 



musculus 



<400> 115 

cgtttgtgga ggaaacggtt ccacatgcaa gaagatgtca ggaatagtca ctagtacaag 60 
acctgggtat catgacattg tcacaattcc tgctggagcc accaacattg aagtgaaaca 120 
tcggaatcaa agggggtcca gaaacaatgg cagctttctg gctattagag ccgctgatgg 180 
tacctatatt ctgaatggaa acttcactct gtccacacta gagcaagacc tcacctacaa 240 
aggtactgtc ttaaggtaca gtggttcctc ggctgcgctg gagagaatcc gcagctttag 300 
tccactcaaa gaacccttaa ccatccaggt tctt 334 



<210> 116 
<211> 528 
<212> DNA 
<213> Mus 



musculus 



-313- 



<400> 116 

agaattcctg gatgatggtc atggtaattg 
atgaccatca tccaggaatt ctgtgatggt 
tgcatcgatg ctggtaagga ttgaagacat 
ctcttcacag aatttggaat cgtcatgaga 
cacagtgaag gctgcatgga ggccatcatc 
acatatggtc ccaacgtctg ccattcccag 
ctctcgggtg aacaggatgg ctgcatcgta 
atgttggtgc tgccatttgc aaaagttctt 
cagactcgtg tccttgtccg tcagcaccac 

<210> 117 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<400> 117 

atttgatagg atagttagtg aggattatta 

ccccgggcat tattattatt atttcttttg 

agcaattgtc aaaaaaagtt agaactatta 

cttagtatca tgggggttgg gaaatgaaaa 

tgttttactt tacctcacta acaatggggg 

agcactgttt atgggctgct atgggtttca 

ggatttaaaa cttcagattg ttccaaccng 
taaccggcaa tnggcctt 

<210> 118 

<211> 455 

<212> DNA 

<213> Home sapiens 

<400> 118 

atttgatagg atagttagtg aggattatta 
ccccgggcat tattattatt atttcttttg 
agcaattgtc aaaaaaagtt agaactatta 
cttagtatca tgggggttgg gaaatgaaaa 
tgttttactt tacctcacta acaatggggg 
agcactgttt atggctgcta tggtttcaga 
taaaacttcc agattgtttc aacatggaga 
aaatgggctt cctcttttcc tttttgggac 



cttccgtggt 


aggtctagca 


aacaattacc 


60 


ggctgacgtg 


catttggacc 


agggcttgga 


120 


taaacgcttg 


tcttctgtag 


taccgaagtt 


180 








24 0 


ttcaatcact 


gcacagctgc 


gctccggaga 


300 


ggtgtcacat 


gaatgatgcc 


cacataaatc 


360 


gtgctcttcg 


tgatcatccc 


ctagctggtt 


420 


gagggtcgtg 


gccgcattct 


tgctcacctc 


480 


caccttcacc 


accgccag 




528 


acctctgagc 


agtgatatag 


cataataaag 


60 


ttacatctat 


tacaagttta 


gaaaaaacaa 


120 


caacccctgt 


ttcctggtac 


ttatcaaata 




gtaggagaaa 


agtgagattt 


tactaagacc 


240 


gagaaaggag 


tacaaatagg 


atct ttgacc 


300 


gaggaatgtt 


tatacattat 


ttctacccga 


360 


gaggggaagg 


gcttccggcc 


aacgtggaat 


420 








438 


acctctgagc 


agtgatatag 


cataataaag 


60 


ttacatctat 


tacaagttta 


gaaaaaacaa 


120 


caacccctgt 


ttcctggtac 


ttatcaaata 


180 


gtaggagaaa 


agtgagattt 


tactaagacc 


240 


gagaaaggag 


tacaaatagg 


atct tugacc 


300 


gaatgtttat 


acatr atttc 


taccgaggat 


360 


ggaaaggctc 


aggcaacgtg 


gaaataacgc 


420 


cntct 






455 



-314- 



<210> 119 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 119 

aatttgatag gatagttagt gaggnttatt 
gccccgggca ttattattat tattnctttt 
aagcaattgt caaaaaaagt tagaactatt 
acttagtatn atgggggttg ggaaatgaaa 
ctgttttact ttacctcact aacaatgggg 
cagcactgtt tatggctgct atggtttcag 
nttaaaactt cagattgttc 



aacctctgag cagtgatata gcataataaa 60 
gttacatcta ttacaagttt agaaaaaaca 120 
acaacccctg tttcctggta cttatcaaat 18 0 
agtaggagaa aagtgagatt ttactaagac 240 
ggagaaagga gtacanatag gatctttgac 300 
aggaatgttt atacattatt tctaccgaga 360 

380 



<210> 120 
<211> 199 
<212> DNA 
<213> Mus musculus 



<400> 120 

caatggcagc ttgctggcta taatagccgc tgatggtacc tatatactga atggaaactt 60 
cactctgtcc acactagagc aagacctcac ctacgaatgt actgtcttaa ggtacagtgg 120 
ttcctcggct gcgcaggaaa gagtccgcag ctttagtcca ctcaaataac ccttaaccat ISO 
ccaggttctt atggtagga 199 



<210> 121 
<211> 439 
<212> DNA 
<213> Homo sapiens 



<400> 121 

atttaacctc tgagcagtga tatagcataa taaagccccg ggcattatta ttattatttc 60 
ttttgttaca tctattacaa gtttagaaaa aacaaagcaa ttgtcaaaaa aagttagaac 120 
tattacaacc cctgtttcct ggtacttatc aaatacttag tatcatgggg gttgggaaat 180 
gaaaagtagg aggaaagnng agnttttact aagacctgtt ttacctttac ctcactaaca 240 
atggggggag aaaggagtac aaataggatc tttgaccagc actgtttatg gctgctatgg 300 
tttcagagaa tgtttataca ttatttctac cgagaattaa aacttcagat tgttcaacat 360 
ggagagaaag gctcagcaac gtggaaataa cgcaaatggg cttccccctt tccctttttt 420 
gggaccatct caggtcctt 4 39 
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<210> 122 
<211> 471 
<212> DNA 
<213> Homo sapiens 



<400> 122 

cagagtaaac tgccagttgc aaatttgata 
gcagtgatat agcataataa agccccgggc 
tctattacaa gtttagaaaa aacaaagcaa 
cctgtttcct ggtacttatc aaatacttag 
agaaaagtga gattttacta agacctgttt 
aggagtacaa ataggatctt tgaccagcac 
tttatacatt atttctaccc gagaattaaa 
ctccagcaac gtgaaattaa cgccaatggc 

<210> 123 
<211> 424 
<212> DNA 
<213> Homo sapiens 



ggatagttag 


tgaggattat 


taacctctga 


60 


attattatta 


ttattatttc 


ttttgttaca 


120 


ttgtcaaaaa 


aagttagaac 


tattacaacc 


180 


tatca-cgggg 


gttgggaaat 


gaaaagtagg 


240 


tacttttcct 


cactaacaat 


ggggggagaa 


300 


tgtttatggc 


tgctatggtt 


tcagagaatg 


360 


acttcagatt 


ggttcaacat 


gagagaaagg 


420 


ttcctccttc 


ccttttttgg 


a 


471 



<400> 123 

cgtgaggatt attaacctct gagcagtgat 
attatttctt ttgttacatc tattacaagt 
gttagaacta ttacaacccc tgtttcctgg 
tgggaaatga aaagtaggag aaaagtgaga 
ctaacaatgg ggggagaaag gagtacaaat 
ctaatggttt cagagaatgt ttatacatta 
ttcaacctga gagaaaggct cagcaacgtg 
tttg 



atagcatant 


aaagccccgg 


nattattatt 


60 


ttagaaaaaa 


caaagcaatt 


gtcaaaaaaa 


120 


tacttatcaa 


atacttagta 


tcatgggggt 


180 


ttttactaag 


acctgtttta 


ctttacctca 


240 


aggatctttg 


accagcactg 


tttatggctg 


300 


tttctacccg 


agaattaaaa 


cttcagattg 


360 


aaatnacgcc 


aatggcttcc 


tctttccctt 


420 








424 



<210> 124 
<211> 458 
<212> DNA 
<213> Homo sapiens 



<400> 124 

tacatctatt acaagtttag aaaaaacaaa 
aacccctgtt tcctggtact tatcaaatac 
taggagaaaa gtgagatttt actaagacct 
agaaaggagt acaaatagga tctttgacca 
aatgtttata cattatttct accgagaatt 



gcaattgtca aaaaaagtta gaactattac 60 
ttagtatcat gggggttggg aaatgaaaag 120 
gttttacttt acctcactaa caatgggggg 180 
gcactgttta tggctgctat ggtttcagag 240 
aaaacttcag attgttcaac atgagagaaa 300 
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ggctcagcaa cgtgaaataa cgcaaatggc ttcctctttc cttttttgga ccacagccag 360 
ccttggtctc cttgcagtgg ctacatgatt acatcatttc tggataatag tcatggggaa 420 
tgtttgatgg acaagctcag aatcccatac agntccca 458 

<210> 125 
<211> 968 
<212> PRT 

<213> Homo sapiens 
<400> 125 

Met Gin Arg Ala Val Pro Glu Gly Phe Gly Arg Arg Lys Leu Gly Ser 
15 10 15 

Asp Met Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly 
20 25 30 

Pro Val Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser 
35 40 45 

Asp Ala Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro 
50 55 60 

Glu Leu Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His 
65 70 75 80 

Ala Phe Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe 
85 90 95 

Leu Ala Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser 
100 105 110 

Glu Thr Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly 
115 120 125 

Thr Val Asn Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu 
130 135 140 

Gly Val Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe He Gin 
145 150 155 160 

Pro Leu Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu 
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165 170 175 

Lys Pro Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin 
180 185 190 

Gly Asp Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro 
195 200 205 

Thr Gly Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu 
210 215 220 

Asp Glu Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val 
225 230 235 240 

Gly Gin Pro Thr Gly Thr Gly Ser lie Arg Lys Lys Arg Phe Val Ser 
245 250 255 

Ser His Arg Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala 
260 265 270 

Glu Phe His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser 
275 280 285 

Val Ala Ala Arg Leu Tyr Lys His Pro Ser lie Arg Asn Ser Val Ser 
290 295 300 

Leu Val Val Val Lys lie Leu Val He His Asp Glu Gin Lys Gly Pro 
305 310 315 320 

Glu Val Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp 
325 330 335 

Gin Lys Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp 
340 345 350 

Thr Ala He Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys 
355 360 365 



Asp Thr Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg 
370 375 380 
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Ser Cys Ser Val lie Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr 
385 390 395 400 

Ala His Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys 
405 410 415 

Gin Cys Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala 
420 425 430 

Ser Met Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser 
435 440 445 

Ala Tyr Met lie Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu 
450 455 460 

Met Asp Lys Pro Gin Asn Pro lie Gin Leu Pro Gly Asp Leu Pro Gly 
465 470 475 480 

Thr Ser Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp 
485 490 495 

Ser Lys His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys 
500 505 510 

Thr Gly Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys His Phe Pro 
515 520 525 

Trp Ala Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys lie Asn Gly 
530 535 540 

Lys Cys Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe His 
545 550 555 560 

Gly Ser Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys 
565 570 575 

Gly Gly Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro 
580 585 590 

Lys Asn Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser 
595 600 605 
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Cys Asn Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu 
610 615 620 

Glu Gin Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser 
625 630 635 640 

Gly Pro Ala Val Glu Trp lie Pro Lys Tyr Ala Gly Val Ser Pro Lys 
645 650 655 

Asp Arg Cys Lys Leu lie Cys Gin Ala Lys Gly lie Gly Tyr Phe Phe 
660 665 670 

Val Leu Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser 
675 680 685 

Thr Ser Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg 
690 695 700 

lie lie Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly 
705 710 715 720 

Asn Gly Ser Thr Cys Lys Lys lie Ser Gly Ser Val Thr Ser Ala Lys 
725 730 735 

Pro Gly Tyr His Asp lie lie Thr lie Pro Thr Gly Ala Thr Asn lie 
740 745 750 

Glu Val Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe 
755 760 765 

Leu Ala lie Lys Ala Ala Asp Gly Thr Tyr lie Leu Asn Gly Asp Tyr 
770 775 780 

Thr Leu Ser Thr Leu Glu Gin Asp ^Ile Met Tyr Lys Gly Val Val Leu 
785 790 795 800 



Arg Tyr Ser Gly Ser Ser Ala Ala Leu Glu Arg lie Arg Ser Phe Ser 
805 810 815 



Pro Leu Lys Glu Pro Leu Thr lie Gin Val Leu Thr Val Gly Asn Ala 
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820 - 

Leu Arg Pro Lys lie 
835 

Ser Phe Asn Ala lie 
850 

Gly Glu Cys Ser Lys 
865 

Glu Cys Arg Asp lie 

885 

Val Lys Pro Ala Ser 
900 

Trp Gin Leu Gly Glu 
915 

Tyr Lys Lys Arg Ser 
930 

Ser His Glu Ser Cys 
945 



825 

Lys Tyr Thr Tyr Phe Val 
840 

Pro Thr Phe Ser Ala Trp 

855 

Ser Cys Glu Leu Gly Trp 
870 875 

Asn Gly Gin Pro Ala Ser 
890 

Thr Arg Pro Cys Ala Asp 
905 

Trp Ser Ser Cys Ser Lys 
920 

Leu Lys Cys Leu Ser His 
935 

Asp Pro Leu Lys Lys Pro 
950 955 



830 

Lys Lys Lys Lys Glu 
845 

Val He Glu Glu Trp 
860 

Gin Arg Arg Leu Val 
880 

Glu Cys Ala Lys Glu 
895 

His Pro Cys Pro Gin 
910 

Thr Cys Gly Lys Gly 

925 

Asp Gly Gly Val Leu 
940 

Lys His Phe lie Asp 
960 



Phe Cys Thr Met Ala Glu Cys Ser 
965 



